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ABSTRACT 

This publication reports the results of an effort to 
assess the consequences of targeting federal compensatory funds 
toward all elementary students showing low achievement rather than 
only to those who were first low income and then low achieving. The 
document deals first with technical issues--the suitability, 
feasibility, and cost implications of the use of performance 
criteria. It then considers the fiscal issue of the consequences of 
resource redistribution from the use of performance criteria. The 
report's overall ronclusion is that serious but not insurmountable 
political, technical, and cost problems would exist in the creation 
of the required data base. However, should these problems be solved, 
the report argues that a ootentially appealing combination of iocome 
and performance^ criteria could be created at a 25 percent increase in 
program cost. Additionally, it reports that by allowing a choice 
between incojie/AFDC and performance criteria, the program could be 
designed to prevent any district from losing monies while at the same 
time aiding those nonpoverty students who are educationally 
disadvantpiged. (Author/DN) 
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POLICY RESEARCH REPORT 



A Policy Research Report is an omcial document of the Center for the 
Study of Social Policy. It presents results of work directed toward specific 
research objectives. The report is a comprehensive treatment of the ob* 
jectives, scope, methodology, data, analyses^ and conclusions, and presents 
the background, practical significance, and technical information required 
for a complete and full understanding of the research activity. The report 
is designed to be directly useful to social policy makers. 

RESEARCH MEMORANDUM 

A Research Memorandum is a working paper that presents the results of work 
in progress. The purpose of the Research Memorandum is to invite comment 
on research in progress. It is a comprehensive treatment of a single research 
area or of a facet of a research area within a larger field of study. The Memo* 
randum presents the background, objectives, scope, summary and conclu* 
sions, as well as method and approach, in a condensed form. Since it presents 
views and conclusions drawn during the progress of research activity, it may 
be expanded or modified in the light of further research. 

RESEARCH NOTE 

A Research Note is a working paper that presents the results of study related 
to a single phase or factor of a research problem. It also may present pre* 
liminary exploration of a social policy issue or an interim report which may 
later appear as a larger study. The purpose of the Research Note is to 
instigate discussion and criticism. It presents the concepts, findings, and/or 
conclusions of the author. It may be altered, expanded, or withdrawn 
at any time. 



PREFACE 



The Use of Performance Criteria to Allocate Compensatory Education 
Funds Is a two-part study by SRPs Educational Policy Research Center. 
It represents a broadly based effort to assess the consequences that 
would occur If federal compensatory funds were targeted toward all ele- 
mentary students showing low achlever.ent rather than only to those who 
were first low Income and then low achieving. 

Such a shift was brought to national attention In 1973 by H. R. A163, 
authored by Representative Albert Qule. The billys specific proposr4ls 
were not adoptrJ by the 93rd Congress, but the question of compensatory 
education for the primarily economically disadvantaged versus the edu- 
cationally disadvantaged Is almost certain of continued national debate 
relating to educational goals and to techn5.cal and fiscal problems. 
These two companion analyses have been prepared as substantive evidence 
for the technical and fiscal discussions and as support Information for 
the broader discussions of national education goals. The Summary that 
follows covers both parts. 

Part I.* ''Technical Issues; Suitability » Feasibility ^ and 

Cost Implications of the Use of Performance Criteria/' 
by John A. Emrlck. 

Part II: "Fiscal Issues; Resource Redistribution Consequences 
of the Use of Performance Criteria/' by James W. 
Guthrie I Anne S. Frentz^ and Rita M. Mlze. 
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SUMMARY 



Currently ESEA Title I monies are allocated to the states and coun- 
ties based upon income data from the Census and APDC data from the wel- 
fare roles. These monies then pass on to school districts and to 
particular schools within each district based upon similar income and 
welfare considerations. Finally, within the individual school, monies 
are targeted on the most educationally disadvantaged students regarr;iess 
of their family's income level or welfare status. Nevertheless, the 
result of this process is that the majority of Title I money is spent 
on students from low income and welfare families. This is consistent 
with the intent of the basic legislation which was part of a series of 
laws designed to overcome the effects of poverty. 

An alternative to the above procedure would be to focus money 
directly upon the most educationally disadvantaged. Since school per- 
formance is only moderately correlated with poverty— i ,e., not all poor 
children perform poorly, nor do all non-poor children perform well— this 
proposal would of necessity allocate more money to children not in poverty. 
Such a proposal is found in H.R. 5163. 

Ultimately, the choice of allocation criterion flows from Congress' 
sense of priorities and values at a particular time. To inform this 
choice we have prepared papers to aid in answering two questions. First, 
what are the implications of creating a performance data base that could 
be used to allocate monies to low-achieving students? Second, what would 
be the effect of such changes in the flow of Title I monies if performance 
criteria were used? 



Our overall conclusion is that serious but not insurmountable 
political^ technical, and cost problems exist in the creation of the 
required data base. If these problems can be solved, then a potentially 
appealing combination of income and performance criteria could be created 
at a moderate (25 percent) increase in program cost* By allowing a choice 
between income/AFDC and performance criteria, the program could be de- 
signed so as not to cause any district to lose monies while at the same 
time aiding those non-poverty students who are educationally disadvantaged* 

We therefore recommend that either NIE or a similar organization, 
working in a participatory mode with the educational community, analyze 
the range of data base problems. This analysis should include but not 
be limited to the technical problems of test and measurement, and should 
make recommendations within a year on the political, technical, and 
fiscal alternatives and on plans to implement such alternatives. 

Among our specific findings are the following. 

Creation of Performance Data Base 

If one assumes an intent to identify specific children with an 
educational deficit, presumably in basic skills, and allocate monies 
directly for them, then: 

• Variations in curriculum across states and to a lesser extent, 
among school districts within a state, raise complex questions 
regarding the practicality and meaning of any national measure 
of educational disadvantage at a given grade level, 

• To the extent that national measures of educational disadvantage- 
ment are created, they will also imply a national curriculum, 
i,e,, a consensus about what should be learned by a particular 
grade level. Such a consensus may not be desired by either the 
Congress or the states, 

• With the present state of the ert of test construction, and the 
variability of test performance for a given child on a given day, 
numerous "incorrect'* inclusions and exclusions to the program 



would result • The percentage of students "misclasslf led** 
might not be large but the absolute number would be« 

• The total cost of administering the necessary tests to stu- 
dents could approach $250 million^ an appreciable fraction 
of all Title I monies. States and school districts could 
obtain other benefits from the test information, e.g., a few 
states such ao California already test every child because it 
is felt that such information is already worth the cost« How- 
ever, we could not estimate the value of the benefits that 
would flow to states and local districts from a national 
testing program. 

• There could well be differential effectiveness resulting 
from monies allocated under a performance criterion. It 
appears possible, though far from certain, that the use of 
a performance criterion might somewhat lower the effective- 
ness of the program per dollar spent, e.g., more children 
receiving assistance might already be operating at their 
capability level under the performance criteria than under 
the poverty/welfare criteria. 

• If a performance criterion wero used only to allocate 
monies to the school district level, this would con- 
siderably reduce the test and measurement problem and the 
cost of the data. However, this brings up the questions 
of what is then to be done with the monies once inside 
the school district, and what allocation criterion is to 
prevail there. 

Changes in Allocation 

Using existing results from state testing programs, it was possible 
to estimate the changes in allocation of monies between states (for a 
14-state sample) and within states (for a 7-state sample) which would 
result from the use of alternative criteria. Rather than dollar amounts, 
the comparisons were based upon changes in the number of eligible students 
under various alternative criteria. 

The analyses used definitions of educational disadvantage which ranged 
from below the 10th percentile to below the 35th percentile on the tests. 
However, the pattern of results was not sensitive to the percentile 
chosen and the results described in this summary are for students below 
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the 15th percentile unless otherwise stated • Further analyses by school 
district and student characteristics were then made for California and 
Alabama. Finally, analyses were made of the effect and cost of a combined 
criterion that used the higher of either the income/AFDC or a performance 
criterion. 



Within States 

• Within states, the districts with currently low concentrations 
of Title Z eligible students would gain funds, and those with 
high concentrations would lose funds. 

• Within California a further refinement of the above results 
shows that cities of populations greater than 200,000 would 
?.ose funds while smaller cities, suburbs, and rural areas all 
would gain monies. 

Los Angeles County would lose 6 percent of its overall funds. 
However, if the county is divided into highly urbanized areas — 
Los Angeles City and Long Beach-- and the rest of the county, 
we find that Los Angeles and Long Beach would lose 16 percent 
and 32 percent respectively while the rest of the county would 
gain 15 percent. 

No California district with over 30 percent minority enrollment 
would gain money. Separating out black students, no district 
with over 10 percent black students in its enrollment would grin 
funds . 

• The minority analysis for Alabama contradicts the above results 
in that the integrated districts (20 percent to 50 percent black 
enrollment) would gain funds whiJo districts both above and 
below this mid-range would lose money. Thus it was the integrated 
districts that benefited. 



Among States 

• Out of the sample of 14 states, the five Southern states (Alabama, 
Arkansas, Mississippi, North Carolina, and Tennessee) showed 
sizable gains, from 65 percent to 125 percent, when a performance 
criterion was used for allocation. Two New England states. New 
Hampshire and Rhode Island, also showed moderate gains. The 
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pattern of change for the rest of the states varied: Arizona^ 
-9 percent; California, +4 percent; Hawaii, 0 percent; Iowa, 
-54 percent; New Mexico, +57 percent; North Dakota, -5 percent. 

Current Title I Versus a Performance Criterion 

• A formula that would allow a school district to choose either 
the current Title I criterion or a performance criterion to 
calculate its Title I entitlement would aid educationally dis- 
advantaged children of working and middle income families while 
not reducing funds to the children of poverty families. The 
broad question we addressed here was: what would be the increase 
in costs if allocations to current students were maintained and 
additional students were funded at the same cost level. 

Based upon an analysis of sir: states, the average increase in 
cost appears to be about 25 percent, with the probable exception 
of southern states. For Alabama, the one southern state In the 
sample, the increase would be 165 percent. As previously mentioned, 
a performance criterion would mainly aid southern states relative 
to other regions, 

• The poverty versus performance criteria effects were al^o analyzed 
with a restriction assumed—that a district could only use the 
performance criteria if it had more than 16 percent of its stu- 
dents below the sixteenth percentile — in the belief that each 
district had some inherent capacity to effectively help small 
numbers of educationally disadvantaged children » The rasulting 
reduction in cost was quite modest for most states ar. ' negligible 
for the southern states, 

• An analysis was performed in which the poverty popul:moi; anJ 
the academically-deficient population were combined, T4r/ 
rationale behind this was that the needs of each population 
were, to a degree, distinct. Thus double-counting would 
enable the needs of both the poverty population (auxiliary, 
supportive health services, for example) and the academi- 
cally deficient population to be targeted and met more 
effectively. 

Under this arrangement the increase in funding Is approximately 
25 percent. 
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D-62 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000: New Mexico 11-221 

D-63 Analysis of Alternative Poverty Criteria 

Family Income Under $3,000: Alabama 11-222 

D-64 Analysis of Alternative Poverty Criteria 

Family Income Under $3,000: Alabama 11-223 

D-65 Analysis of Alternative Poverty Criteria 

Family Income Under $4,000: Alabama 11-224 

D-66 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000: Alabama 11-225 

D-67 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000: Alabama 11-226 

D-68 Ani^lysis of Alternative Poverty Criteria 

Orshansky Poverty Index: California 11-227 

D-69 Analysis of Alternative Poverty Criteria 

Family Income Under $3,000: California 11-228 

D-70 Analy&^s of Alternative Poverty Criteria 

Family Income Under $4,000: California 11-229 

D-71 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000: California 11-230 



xxiv 



ERLC 



D-72 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000: California 11-231 

D-73 Analysis of Alternative Poverty Criteria 

Qrshansky Poverty Index: Delaware 11-232 

D-74 Analysis of Alternative Poverty Criteria 

Family Income Under $3,000: Delaware 11-233 

D-75 Analysis of Alternative Poverty Criteria 

Family Income Under $4,000: Delaware . 11-234 

D-76 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000: Delaware 11-235 

D-77 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000: Delaware 11-236 

D-78 Analysis of Alternative Poverty Criteria 

Orshansky Poverty Formula: Iowa 11-237 

D-79 Analysis of Alternative Poverty Criteria 

Family Income Under $3^000: Iowa 11-238 

D-80 Analysis of Alternative Poverty Criteria 

Family Income Under $4^000: Iowa 11-239 

D-81 Analysis of Alternative Poverty Criteria 

Family Incrxne Under $5^000: Iowa 11-240 

D-82 Analysis of Alternative Poverty Criteria 

Family Income Under $6^000: Iowa 11-241 

D-83 Analysis of Alternative Poverty Criteria 

Orshansky Poverty Formula: Massachusetts 11-242 

D-84 Analysis of Alternative Poverty Cx^itesvt<a 

Family Income Under $3^000: Massachusetts 11-243 

D-85 Analysis of Alternative Poverty Criteria 

Family Income Under $4,000: Massachusetts 11-244 

D-86 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000: Massachusetts ......... 11-245 

D-87 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000: Massachusetts 11-246 

D-88 Analysis of Alternative Poverty Criteria 

Orshansky Poverty Formula: New Mexico 11-247 

D-89 Analysis of Alternative Poverty Criteria 

Family Income Under $3,000: New Mexico 11-248 
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D-90 Analysis of Alternative Poverty Criteria 

Family Income Under $4,000: New Mexico 11-249 

D-91 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000: New Mexico 11-250 

D-92 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000: New Mexico 11-251 

D-93 Analysis of Alternative Poverty Criteria 

Orshansky Poverty Formula: Alabama 11-252 

D-94 Analysis of Alternative Poverty Criteria 

Family Income Under $3,000: Alabama 11-253 

D-95 Analysis of Alternative Poverty Criteria 

Family Income Under $4,000: Alabama 11-254 

D-96 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000: Alabama 11-255 

D-97 Analysis of Alternritlve Poverty Criteria 

Family Income Under $6,000: Alabama 11-256 

D-98 Analysis of Alternative Poverty Criteria 

Orshansky Poverty Formula: California 11-257 

D-99 Analysis of Alternative Poverty Criteria 

Family Income Under $3,000: California 11-258 

D-lOO Analysis of Alternative Poverty Criteria 

Family Income Under $4,000: California 11-259 

D-101 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000: California 11-260 

D-102 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000: California 11-261 

D-103 Analysis of Alternative Poverty Criteria 

Orshansky Poverty Formula: Delaware • « . . 11-262 

D-104 Analysis of Alternative Poverty Criteria 

Family Income Under $3,000: Delaware 11-263 

D-105 Analysis of Alternative Poverty Criteria 

Family Income Under $4,000: Delaware ...... 11-264 

D-106 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000: Delaware 11-265 

D-107 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000: Delaware 11-266 
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D-108 Analysis of Alternative Poverty Criteria 

Orshansky Poverty Formula: Iowa 11-267 

D-109 Analysis of Alternative Poverty Criteria 

Family Income Under $3^000: Iowa 11-268 

D-110 Analysis of Alternative Poverty Criteria 

Family Income Under $4|000: Iowa 11-269 

D-111 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000: Iowa 11-270 

D-112 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000: Iowa 11-271 

D-113 Analysis of Alternative Poverty Criteria 

Orshansky Poverty Formula: Massachusetts 11-272 

D-114 Analysis of Alternative Poverty Criteria 

Family Income Under $3,000: Massachusetts 11-273 

D-115 Analysis of Alternative Poverty Criteria 

Family Income Under $4,000: Massachusetts 11-274 

D-116 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000: Massachusetts 11-275 

D-117 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000: Massachusetts 11-276 

D-118 Analysis of Alternative Poverty Criteria 

Orshansky Poverty Formula: New Mexico 11-277 

D-119 Analysis of Alternative Poverty Criteria 

Family Income Under $3,000— New Mexico 11-278 

D-120 Analysis of Alternative Poverty Criteria 

Family Income Under $4, 000- -New Mexico 11-279 

D-121 Analysis of Alternative Poverty Criteria 

Family Income Under $5,000— New Mexico 11-280 

D-122 Analysis of Alternative Poverty Criteria 

Family Income Under $6,000— New Mexico 11-281 

D-123 Analysis of Interstate Shifts In Proportion of Title I 

Children When Current Definition of Poverty Is Contrasted 
With Alternative Criteria 

Rocky Mountains Region , 11-282 

D-124 Analysis of Interstate Shifts In Proportion of Title I 

Children When Current Definition of Poverty Is Contrasted 
With Alternative Criteria 

Great Lakes Region 11-283 
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D-125 Analysis of Interstate Shifts in Proportion of 
Title I Children When Current Definition of 
Poverty is Contrasted With Alternative Criteria 

Mideast Region 11-284 

D-126 Analysis of Interstate Shifts in Proportion of 
Title I Children When Current Definition of 
Poverty is Contrasted With Alternative Criteria 

New England Region 11-285 

D-127 Analysis of Interstate Shifts In Proportion of 
Title I Children When Current Definition of 
Poverty Is Contrasted With Alternative Criteria 

Pacific Region 11-286 

D-128 Analysis of Interstate Shifts In Proportion of Title I 

Children When Current Definition of Poverty is Contrasted 
With Alternative Criteria 

Plains Region , . . . 11-287 

D-129 Analysis of Interstate Shifts In Proportion of Title I 

Children When Current Definition of Poverty Is Contrasted 
With Alternative Criteria 

Southeast Region 11-288 

D-130 Analysis of Interstate Shifts in Proportion of Title I 

Children When Current Definition of Poverty Is Contrasted 
With Alternative Criteria 

So uthwest Region II -289 

D-131 Alabama Double-Count, Simulation I Sum of Present 

Title I Children and Low Achieving Children 11-290 

D-132 California Double-Count, Simulation I Sum of Present 

Title I Children and Low Achieving Children 11-291 

D-133 Delaware Double-Count, Simulation I Sum of Present 

Title I Children and Low Achieving Children ... 11-292 

D-134 Iowa Double-Count, Simulation I Sum of Present Title I 

Children and Low Achieving Children 11-293 

D-135 Massachusetts Double-Count i Simulation I Sum of 

Present Title I Children and Low Achieving Children .... 11-294 

D-136 New Mexico Double-Count, Simulation I Sum of Present 

Title I Children and Low Achieving Children 11-295 

D-137 Alabama Double-Count, Simulation II 

Sum of Orshansky Children and Low Achieving Children . . . 11-296 

D-138 California Double-Count, Simulation II 

Sum of Orshansky Children and Low Achieving Children . . . 11-297 
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D-139 Delaware Double-Count, Simulation II 

Sum of Orshansky Children and Low Achieving Children . . . Ii-298 

D-140 Iowa Double-Count, Simulation I I 

Sum of Orshansky Children and Low Achieving Children . . . 11-299 

D-141 Massachusetts Double-Count, Simulation II 

Sum of Orshansky Children and Low Achieving Children . . . 11-300 
D-142 New Mexico Double-Count, Simulation II 

Sum of Orshansky Children and Low Achieving Children . . . 11-301 

D-143 Alabama Simulation I Analysis by Percentage of 
Children who Currently Qualify as Title I 

Formula Children 11-302 

D-144 Alabama Simulation II Analysis of Shifts by Percentage 
of Children Living In District who Qualify as 

Orshansky Children 11-303 
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I OVERVIEW 



This report studios the implications of an alternative method of al- 
locating and targeting funds for ESEA Title I which aroused considerable 
debate during the past year, particularly in the House of Representatives' 
General Education Subcommittee, Title I is a categorical program of 
financial assistance to targeted local schools and districts which contain 
economically disadvantaged children above some minimal number. The pro- 
gram has been in effect for eight years and involves a current annual 
outlay of $1.5 billion. The principal sources of eligibility data are 
the census (providing income data) and the welfare roles (providing AFDC 
data). In FY 1972, eight million children were targeted for Title I, 
which means the average per pupil outlay was less than $200. But funds 
are not allocated to individual children. Rather, they are allocated to 
Local Education Agencies (LEAs) through State Education Agencies (SEAs) . 
It is often hard to trace program expenditures beyond the level of the 
LEA. 

In February 1973, Minnesota Representative Albert Quie introduced a 
bill (HR 5163) which he argues would vastly improve the effectiveness of 
Title I. Briefly, Quie contends that school performance is only moderately 
correlated with poverty; hence, many children who are performing badly 
but are not from impoverished families are excluded from Title I services. 
His proposal would amend Title I — first, by substituting performance 
criteria for the socio-economic criteria currently used in determining eli- 
gibility; and second, by requiring that no less than 85 percent of the 
program expenditures be spent on the targeted children for provision of 
individually prescribed instruction toward attainment of negotiated ob- 
jectives in the areas of reading and mathematics. Quie's amendment 
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proposes that a national assessment be made In the areas of reading and 
math 9 using criterion-referenced tests on samples of pupils across three 
grade levels # Allocations of Title I funds to the states would be based 
on relative incidence of disadvantage, defined as performance below some 
minimum level. The states would then be responsible for targeting these 
funds to eligible children within the local agencies, who in turn would 
be responsible for diagnosing individual needs, prescribing appropriate 
treatments or services, and evaluating results. 

Our policy study focuses on four aspects of Quiets alternative — 
goals, feasibility, cost, and impact: 

• The g oals analysis involves an examination of the social and 
legislative implications of the proposed redefinition of edu- 
cational disadvantage. Briefly, the consequences of this re- 
definition are viewed as profound. Quiets proposal, if adopted, 
would alter the thrust of Title I in two basic ways. First, it 
would define ''program** and **eligibility** in terms which are 
independent of social, economic, and cultural factors. This 
would represent a radical change to those who view Title I 

as social legislation designed to compensate for effects of 
adverse environment or to equalize school resources at some 
minimal level. Second, this shift in definition would con- 
stitute a change from the concept ol **equal educational op- 
portunity** to that of **equal educational performance,** albeit 
at some minimal level. Definition of this minimal performance 
standard at the national level would create a further compli- 
cation, because it calls for federal specification of educa- 
tional goals and outcomes* 

• The feasibility study explores methodological and technolog- 
ical requirements for implementing the alternative. The 
general conclusion is that the state of the art regarding 
criterion- referenced systems of measurement is not suffi- 
ciently advanced to meet the demands of ^.uch a massive ap- 
plication. There are even more complex difficulties in 
developing and utilizing a defensible performance-based 
definition of educational disadvantage. For example, to 
derive and employ a performance definition of need on an 
equitable basis, the same criteria would have to be applied 
uniformly throughout the country. However, curricula are 
known to vary substantially within and across states* 
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Therefore, it would be difficult to determine the extent 
to which the results of a national assessment reflected 
these differences in curricular emphases or true differences 
in educational attainment, 

• The cost analysis shows that even if we were convinced that 
a single performance standard was both valid and defensible, 
the task of targeting the program to individual participants 
would constitute an enormous logistics and cost item, Quie 
has suggested that a testing program which would provide the 
data needed for allocations to states could be implemented 
for approximately $5-1/2 million, and that states could 
effectively target the program to their own constituencies. 
It should be recognized, however, that if the definition 

of disadvantage is to be uniform throughout the country, then 
a standard set of measuring procedures must be applied to 
each and every potential participant. This would ultimately 
result in a census testing approach to targeting which, 
based on conservative estimates, would cost in excess of 
$1/4 billion. Since the costs for gathering targeting 
data would amount to 1/6 of the total program budget, they 
cannot be viewed as trivial, 

• The impacts analysis includes a consideration of general 
differences between current recipients of Title I services 
and those who would participate if Quiets approach should 

be adopted. Substantial changes in the distribution pattern 
of program resources appear likely, with at least some portion 
of funds being diverted away from the poorest districts and 
into those more affluent. An increased flow of resources 
into the southern states also appears probable, although the 
magnitude of any redistribution will likely be minimized by 
some form of hold -harmless** provision, A frequently ex-* 
pressed concern regarding the Quie Bill is its potential fo^ 
producing **negative incentives,** whereby programs which 
succeed in promoting pupil growth would risk loss of continued 
support, and programs producing little or no positive growth 
would be assured of continued funding. 

Quiets amendment proposes several remedies to current difficulties 
in Title I, but the implications of these remedies are profound, and 
would introduce major, costly, and long-lasting changes into the national 
program. We think the basic features of the amendment--perf ormance-based 
diagnosis of need and individually-prescribed instruction toward negotiated 



objectives — could be usefully Incorporated in local and state level pro- 
grams; but they do not offer a viable solution to the allocation and tar- 
geting problems of Title I at the national level. We therefore recommend 
against adopting performance -based eligibility criteria for federal tar- 
geting of Title I funds. 
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II THE PROGRAM 



The Elementary and Secondary Education Act of 1965 (ESEA) was signed 
into law April 11, 1965. Federal aid lor educationally deprived children 
was authorized as Title I of P.L. 81-874. In January 1968, Congress of- 
ficially redesignated the enabling legislation as Title I, ESEA— Financial 
Assistance to Local Educational Agencies for the Education of Children 
of Low-Income Families. 

The purpose of this program is "to provide financial assistance ... 
to local educational agencies serving areas with concentrations of chil- 
dren from low-income families to expand and improve their educational 
programs ... [to meetl the special educational needs of educationally de- 
prived children." (Finch, 1969, p. 2.)* The term "educationally deprived 
children" has been defined in the Title I regulations as: 

... those children who have need for special educational 
assistance in order that their level of educational attain- 
ment may be raised to that appropriate for children of their 
age. The term includes children who are handicapped or whose 
needs for such special educational assistance result from 
poverty, neglect, delinquency, or cultural or linguistic iso- 
lation from the community at large. (U.S. Office of Education, 
1969, p. 7.) 

In size. Title I represents the largest single federal program of 
categorical aid to public elementary and secondary schools. The budget 
for FY 1972 (for which the most recent figures are available) involved a 
total expenditure of nearly $1.6 billion, directed toward the needs of 
nearly 8 million pupils in 16,000 school districts throughout the United 
States and its territories. This single program amounts to nearly 30% 



* 

The references are listed after the mair t«xt of this report. 



of the total USOE budget, and nearly 70% of all USOE assistance to the 
di sadvantaged. 

Because of the need to maintain a constitutionally based (or implied) 
separation of federal and state responsibilities in the conduct and financ- 
ing of education, Title I appropriations are allocated to State Education 
Agencies (SEAs) , which in turn distribute resources to local education 
agencies (LEAs) —primarily districts, but occasionally larger or smaller 
units such as counties or schools— on the basis of relative disadvantage 
or need of their student populations. Since Title I is a categorical pro- 
gram, state level allocations are determined on a (complex) formula basis. 
Briefly, a given state's share of total revenues is determined by the num- 
ber of children within the state, aged five to seventeen, who 

• Are members of low-income families ($2,000 or less annual 
income) ; or 

• Are members of families receiving more than $2,000 annually 
in the form of AFDC assistance; or 

• Are in foster homes; or 

• Are in institutions for neglected or delinquent children 
which do not receive state education assistance. 

This total of the four groups is then multiplied by one-half the 
average state or national per-pupil expenditure, whichever is higher. 

By law and tacit agreement, states are required to observe federal 
regulations and guidelines in administering these program allocations 
(delivering program services) to appropriately targeted pupils via LEA 
Title I. But because of the enormous range and complexity of intra- 
and inter-state variability on nearly all aspects relevant to program 
administration, coupled with generally inadequate state administrative 



*National Advisory Council for the Education of Disadvantaged Children, 
p. 7, (1972). 
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allocations (e.g.^ 1%) , noncompliance with federal guidelines at the 
local level Is frequently observed with Title I projects. 

Program Criticisms 

A program as large and complex as Title I Is bound to encounter 
problems and to show some deficiencies. Recently ^ however ^ the program 
has come under considerable criticism In the following three general 
categories: 

• Allocation Formula . Eligibility or entitlement criteria are 
viewed as unrealistic or Inequitable ^ since their meaning 
varies from state to state, as well as from district to 
district within states. 

• Targeting. The program is criticized as reaching either too 
few or the wrong targets because of unreliability and obsoles- 
cence of targeting data. 

• Delivery of Services . The delivery system is considered in- 
adequate, because, when reviewing program effects^ it is 
difficult to trace expenditures below the level of the LEA, 
and almost impossible to follow them below the level of 
individual school expenditures. 

These three categories of criticisms are considered in more detail 
in the paragraphs to follow. 

Allocation Problems 

The criticism that the current formula produces inequitable dis- 
tribution of resources is well grounded. Major differences in general 
cost of living between primarily agricultural communities and urban in- 
dustrial communities are not accommodated in the current poverty com- 
ponent of the entitlement, resulting in under-allocations to states 
having high proportions of genuinely poor communities whose inflated 
economies do not meet the national poverty criteria. Largely self* 
sufficient (and heavily subsidized) farm states, on the other hand, 
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tend to receive above-average allocations on an urban-adjusted Income 
basis* The crucial problem^ of course, Is In defining an appropriate 
normalizing procedure for adjusting wlthln-state Income to a uniform 
economic base. It is doubtful that a truly normalized Income scale would 
be politically acceptable, since the current economic diversity (which 
this scale would have to accommodate) Is so large that the poverty def- 
inition would probably have to vary by several orders of magnitude 
(e.g., from $1,000 to $6,000). 

The second component of the formula, entitlement based on AFDC, 
also can be validly criticized. Since welfare administration is a state 
rather than federal function, programs and procedures are far from 
standard throughout the nation, and states which have adopted generous 
welfare-eligibility criteria can produce a more than trivial Impact on 
overall allocation patterns. The Title I allocation formula is not 
designed to adjust for such differences, and the result Is a certain 
degree of Inequity in funding which directly reflects state variation in 
welfare policy. Also, there is a tendency for certain ethnic groups to 
reject welfare assistance for primarily cultural reasons (e.g., Chlcanos) . 
These groups, in turn, do not receive their "fair share" oi the com- 
pensatory education support. 

Targeting Problems 

Targeting criticisms overlap to some extent with allocation criti- 
cisms in that both reflect inadequacies of the current entitlement 
procedures. One frequently expressed targeting criticism cites the 
obsolescence of census data (the primary source for income determina- 
tion) and the unreliability of welfare data for targeting state funds 
to LEAs. Census data are gathered only once a decade, and the pro>» 
cesslng lag produces a situation in which the recency of data ranges 
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from two to twelve years. Population growth and mobility patterns coupled 
with shifting economic characteristics of communities vary dramatically 
over this interval, resulting in moderate- to large-scale targeting 
errors on the basis of census income data. The magnitude of these errors 
can be witnessed in the recent difficulties in reallocating and retarget- 
ing, from a basis of 1970 census data. These difficulties precipitated 
current vigorous efforts in some sections to reformulate Title I (to 
preserve existing funding patterns). Another criticism of census data 
acknowledges the census undercount of certain disadvantaged groups. 

A second targeting criticism is that the services are reaching 
neither the number nor the type of children for whom Congress intended 
the program. This criticism takes two forms: (1) that the current level 
of support is inadequate to meet the need; or (2) that the current formula 
excludes many children who should be entitled to services. It is unlikely 
that any overall appropriation level will be Judged completely sufficient, 
because of the ipsative nature of need determination.* But questions do 
arise regarding whether it is preferable to concentrate resources on those 
defined as most in need, or to distribute available services across a 
broader range of disadvantage. The current policy of concentrating re- 
sources at the lowest poverty levels is criticized as serving only 
"approximately one third of the population Congress intended it to serve*' 
(National Advisory Council on the Education of Disadvantaged Children^ 
1972, p, 7), This criticism obviously involves an interpretation of con- 
gressional intent. 



In general, need is defined in relative rather than absolute terms. 
Thus there will always be a 'poor*' group, relative to the total popu- 
lation. 



1-9 



An alternative interpretation of Title I congressional intent is 
advanced by critics who argue that educational disadvantage should be 
defined in terms of academic performance instead of social/economic 
variables. These critics maintain that the current formulation targets 
services to fewer than one-half of school children who are performing 
poorly and thus are in need of educational assistance. The proposed 
alternative, that of defining disadvantage in terms of school per- 
formance, is the basis for much of this report, and is considered in 
detail in several contexts in chapters to follow. 

Delivery Problems 

Criticisms relating to delivery of Title I services point up 
further ambiguity in the intent of the program. Title I is unique 
as a federal assistance program, in that an evaluation component is 
legislatively mandated. However, it is not clear whether the intended 
primary locus of assistance is the school or the pupil. Critics who 
view the program as designed to provide financial aid to poverty-impacted 
schools argue that evaluations should be directed at delivery of such 
assistance, and not at studies of pupil growth. Conversely, those who 
see the principal Title I mission as intervention in the education of 
poor children argue that the diffusion of services at the school level 
accounts for the general failure of the program to produce measurable 
pupil change so far. And, of course, the sheer magnitude and com- 
plexity of the total program, coupled with the organizational vulnera- 
bility of the federal-state and state-local interfaces, has made effective 
and uniform administration of delivery of services a nearly impossible 
task. As a result, funds are inevitably misspent, and the frequency with 
which audit exceptions are discovered may Jeopardize the future of the 
program. 



I-IO 



An Alternative Proposal 

A number of alternative formulations of the Title I program have 
recently been proposed. Perhaps the most radical alternative is HR 5163 
presented by Congressman Albert Quie (Republican, Minnesota) , known as 
the Quie Bill. Briefly, Quie argues that performance is only moderately 
correlated with poverty; hence, many children who are performing badly 
but are not from impoverished families are excluded from the Title I 
services* His proposal would amend Title I in two ways: 

• By substituting performance criteria for social/economic 
criteria in determining eligibility* 

• By requiring that no less than 85% of program funds be spent 
on targeted children in the form of individually prescribed 
instruction toward attainment of negotiated objectives in 
the areas of reading and mathematics. 

To identify eligible children, Quie proposes that a national 
assessment be made in the areas of reading and mathematics, using 
criterion-referenced tests on samples of pupils across three grade 
levels. Allocations of Title I funds to st&tes would be based on rel*- 
ative proportions of children identified as "educationally disadvantaged" 
i*e., those failing to perform at or above criterion levels. The states 
would then be responsible for targeting funds to eligible children within 
LEAs. LEAs would presumably be responsible for diagnosing individual 
needs, prescribing treatments (or negotiating objectives), and evaluating 
results* 

On the surface, this proposal has some attractive and appealing 
features* The approach would focus remedial attention on those children 
who are performing at a substandard level, and appears to offer a tighter 
and more defensible mechanism for resolving problems in allocation, 
targeting, and delivery of services* Upon deeper analysis, however, a 
number of major issues and problems emerge, the implications of which 
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argue against adopting Quiets alternative. These issues are explored 
and discussed in the following policy analysis perspectives throughout 
the remainder of this report: 

• Goals analysis , involving an examination of the social and 
legislative implications of the redefinition of educational 
disadvantage. 

• Feasibility analysis , or a study of the technological and 
methodological demands of the alternative with respect to 
the current state of the art. 

• Cost analysis , or an estimate of the probable cost and 
logistics requirements to implement the alternative. 

• Benefit analysis , involving evaluation of probable short- 
and long-range consequences of the alternative. 
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Ill GOAI£ ANALYSIS 



Quie proposes what appears to be a modest and logical shift In the 
definition of educational disadvantage. However, the consequences of 
this shift are quite profound In that they imply relnterpretatlons of both 
the legislative Intent of Title I, ESEA, and the federal role in the 
specification of educational outcomes. 

Most interpretations of the legislative intent of Title I and other 
provisions of ESEA have been based on an assumed connection between the 
quality of educational environment (such as method and materials of in- 
struction and social and cultural environment), and the educational 
progress of students. Given an awareness of marked differences across 
and often within communities in the social and economic composition of 
local educational environments throughout the nation, Congress enacted 
the Elementary and Secondary Education Act of 1965 in an effort to 
correct the perceived inequities and to promote conditions which would 
better reflect the prevailing philosophy of commitment to providing 
"equal educational opportunity." This was important social legislation, 
since education is generally perceived as the primary instrument by which 
social anci economic mobility are attained. Moreover, to the extent that 
evidence of a relationship between educational progress and social/cultural/ 
economic subgroup membership exists, then a case can be made that equal 
educational opportunity does not exist. Thus, the Title I provision of 
ESEA 'Afas designed primarily to reduce the inequity of educational op- 
portunity on the basis of probably the most basic and direct indicator 
of such potential inequity: namely, family income. Stftted differently, 
the Title I legislation was designed primarily to eliminate the major 
educational consequences of adverse economic conditions within communities* 
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Whether or not the program Is succeeding in this goal is certainly the 
subject of much controversy and continuing Investigation. 

Another consideration Is the separation of federal and state con- 
trol over educational activities, which may make the use of performance 
criteria neither justifiable nor legal. Briefly, the federal constitution 
makes no mention of responsibility for or control over the conduct of 
public education. Many state constitutions, however, do make specific 
reference to educational activities. Thus, traditionally, American 
education has been the states' right and responsibility. In the last 
two decades, tension between the state and federal education agencies over 
the conduct and influence of educational practices has notably increased. 
States are in the complex dilemma of needing financial assistance, which 
the federal government seems capable of providing, and at the same time 
of seeking to maintain autonomy regarding educational policy and practice, 
which many federal assistance programs seek to influence (e.g., desegrega- 
Mon and ESEA) . Qule's alternative clearly establishes a new federal role 
in public education in that it requires specification of national standards 
for educational outcomes in the areas of reading and mathematics. As will 
be shown in later discussions, this is a fundamental requirement of the 
proposed program and cannot be compromised if the alternative is to be 
seriously Implemented. The problem is that a constitutional amendment 
(or at least some enabling legislation) may be required before federal 
specification of performance standards is possible. 

In short, the issue is: Can "educational performance" be substituted 
for "educational opportunity" legitimately, and within the Intent of the 
current legislation? This is not to say that the performance-standard 
approach to educational output — in terms of some minimally acceptable 
criterion— is not a very attractive proposition. Indeed, most educators, 
social and behavioral scientists and philosophers see great potential 
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benefit In this goal. The question becomes **Is Title I the proper 
vehicle for this reform?** Wo think not, but the answer to this question 
obviously depends on how one resolves the foregoing Issues of congressional 
Intent and constitutional legality. But beyond these primarily political 
and legislative concerns are the realistic questions of feasibility and 
cost. That Is, assuming a decision were made today to adopt HR 5163, 
could the alternative actually be Implemented, given our current technical 
and methodological state of the art? Further, how much would It cost; 
and, since It is likely to be more costly, how much greater overall 
educational return would the alternative be likely to produce? Each 
of these major practical Issues Is presented and discussed In the following 
sections of this report. 
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IV FEASIBILITY ANALYSIS 



Quie's proposed amendment to Title I requires more than a simple 
rethinking of the legislative intent of the Title. The methods and 
procedures by which the proposed revision would be implemented are by 
no means clearly available. Many of the components on which implementa- 
tion of Quie's amendment is predicated requiu a state of the art which 
we have not yet in general attained. The purpose of this section is to 
discuss the technical and methodological requirements of the Quie Bill 
and to consider the general feasibility of implementing this approach 
given these requirements. Two large problem areas give rise to serious 
questions regarding the feasibility of any test-based approach to targeting 
resources; these areas are definition of content domains and establish- 
ment of performance (i.e., eligibility) standards. 

Alternative Systems of Measurement 

One of the major features of HR 5163 is its reliance on criterion- 
referenced systems of measurement for developing targeting and allocation 
data, for diagnosing student need, and for assessing student progress or 
program impacts. Criterion-referenced measurement technology and more 
familiar norm-referenced procedures are generally distinguished from each 
other in terms of the purpose for which the measurement is made, rather 
than in the appearance or properties of the measurement instruments 
per se. Criterion-referenced procedures are generally designed to char- 
acterizf; the examinee with regard to the presence of, or measurable 
quantity of, some trait or skill dimension, often with reference to some 
specific criterion. Norm- or distribution-referenced procedures, on the 
other hand, are designed to differentiate individuals with regard to a 
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(possibly similar or identical) trait or skill dimension, and generally 
involve reference to the performance of a **norm sample*' for interpretation 
of scores,* Quie, in his testimony before the House ( Congressional Record , 
6 March 1973) tends to obscure this distinction when he states , ^\ . . through 
the use of a criterion-referenced test we could determine how well the 
average third grader should be able to read^ test third graders to determine 
how many fell far below the expected level of competency , and then allo- 
cate funds to be used to bring those students up to the expected level/' 
This measurement application is much more characteristic of norm-referenced 
than criterion-referenced approaches. 

At present y we distinguish among three types or applications of 
criterion-referenced measurement: objectives-based testing , mastery 
testing, and domain-referenced testing. 

The general form of this objectives-based testing strategy is as 
follows: Given some program or treatment (x) , the student will be able 
to produce the behaviors (y) , under conditions (z) , or at rate (z). The 
definitions of the parameters x, y, and z are left most often to the 
judgment and ingenuity of the teacher or evaluator. As might be expected, 
a great deal of variation (and often triviality) exists in objectives and 
associated measurement procedures under this approach. 

Mastery testing requires the formulation of sequences of contin- 
gencies — generally, but not necessarily, binary or ''pass/fail''— which 
constitute the mastery test points. It is possible to have a single 
mastery test point (e.g., a written driver's test) or multiple tests 
(e.g., a learning hierarchy with separate tests for each level). 



For a discussion of the recent history of norm-referenced measurement 
in American education, the reader is referred to Appendix A. 
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The principal features of domain-referenced tests are as follows: 

(1) An explicit definition of the domain to be measured^ 
including the skills or behaviors and the range of 
eligll^ile content. 

(2) Explicit rules for generating or constructing items 
which will be used to sample the examinee *s status 
with respect to that domain (generally referred to as 
"item forms''). 

(3) Explicit rules for interpreting performance on test 
items. 

These approaches to measurement differ from each other in several 
ways. Nevertheless, they all represent attempts to obtain directly 
interpretable performance data—that is, information which describes 
specific competencies of individual examinees.* Norm-referenced ap- 
proaches, on the other hand, do not generally yield indicators of whether 
specific skills are present or absent in a given examinee. 

Comparison of Testing Alternatives 

At this point it appears that the various achievement-testing 
alternatives could be arrayed along a single continuum with respect to 
the generality or specificity of the domain to which test performance 
is to be generalized. Objectives-based tests would have the most limited 
(i.e., specific) generalizability , whereas standardized achievement tests 
would have the least specific (i.e., the domain becomes very broad, if 
not vague). The mastery-testing and domain-referenced approaches would 
occupy intermediate positions on this dimension and would be seen to vary 
primarily in terms of the decision purposes of the testing. Consider, for 
example, a reasonably well-defined domain at a level of intermediate 



♦ 

For further descriptions of the approaches to measurement, the reader 
is referred to Appendix B, 
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generality, such as **ability to perform any addition operation/* If the 
purpose of testing were to assess whether students could attain some 
specified level of proficiency in addition operations, the testing ap- 
plication would assume the form of mastery-testing procedure, A mastery 
criterion would be established on rational or empirical grounds, and the 
test would be administered to a group of students. Performance would 
then be evaluated with respect to attainment or non-attainment of the 
criterion. 

The same test could be used in a domain-referenced context, with 
test performance evaluated in terms of the domain measured— extent of 
arithmetic ability, error rate in addition problems, or whatever. Finally, 
this same test could also be used in a norm-referenced application if, 
for example, some version of it were administered to a norming sample 
against whose performance that of our target population or sample could 
be compared. If such norming were carefully conducted, using conventional 
procedures, we would now have two possible interpretations of performance 
of our group of examinees on the test in question: 

(1) The proficiency of our examinees in terms of appropriate 
behaviors defined by the domain. 

(2) The relative performance of our examinees compared with 
that typical of the norming sample. 

An example of such a dual interpretation might be as follows: 
Using the above-defined domain of arithmetic addition problems, a teacher 
of the third grade might be able to say at the end of a semester of in- 
struction that her class averages 90 percent correct on any addition 
problem, and that this performance relative to third graders throughout 
the nation is at the 95th percentile, or that her class is performing 
at the fourth-grade level. The obvious advantage of such a multiple 
interpretation is that it allows one to Judge the adequacy or desirability 
of the outcome in terms of the domain itself, as well as relative to the 
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performance of comparable groups. This latter interpretation (the norm- 
. srenced one) must, however, take into consideration the same restric- 
tions that apply to conventional norm-referenced interpretations; i.e., 
since we do not currently utilize a national curriculum, variations in 
performance within a norming sample on any domain-referenced task will 
be influenced by variations both in curriculum and in the individuals 
who constitute the norming sample. Essentially, this consideration 
points to the vulnerability of any norm-referenced approach to problems 
of invalidity. 

These considerations suggest that a domain-referenced approach 
probably would be most appropriate for developing allocation and targeting 
data, whereas a mastery-testing approach might be required for diagnoses 
of individual needs and objectives. Appropriate instruments do not, how- 
ever, currently exist in readily-accessible forms or quantities. Thus, 
in order to implement any test-based approach to targeting resources, 
instruments would have to be developed and validated. There are two major 
problems in developing appropriate domain-referenced tests: definition 
of domains, and establishment of performance standards for defining dis- 
advantage . 

Definition of Domains 

Two questions arise regarding the definition of domains. First, 
should the definition be narrow or broad with respect to content area? 
Second, should it be representative of or independent of current educa- 
tional curricula? The trade-off regarding breadth of domain definition 
is in terms of precision vers is generalizability of estimates. For very 
narrowly defined domains, performance estimates may be precise and reliable, 
but possibly not generalizable to related aspects of content. In this 
sense, the domain-referenced test approaches the objectives-based test. 
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On the other hand, if the domain is very broadly defined, large samples 
of performance are required for reliable interpretation, although gen- 
eralization to the content area becomes more valid. The extreme case 
becomes one in which the domain is defined as ''general ability/' 

Similarly, in defining a domain, considerations of curricula are 
important. If the domain definition were (or attempted to be) inclusive 
oi all current curricula, it probably would resemble current standardized 
achievement tests. If the definition were curriculum-free, it might re- 
semble an IQ test. If the definition were restricted to "dominant** cur- 
ricula, it might be invalid for use with pupils whose educational expe- 
rience is based on quite different curricula. This is to say, we do not 
yet fully understand the relative impacts of variations in curricula on 
educational attainment. It is most likely that these curricular variations 
produce corresponding differences in performance. This means we would 
have to choose one of four options: 

(1) Prepare separate curriculum-specific dom^iin definitions. 

(2) Select one dominant (or arbitrary) curriculum for the 
domain definition. 

(3) Expand the definition to be inclusive of all curricula. 

(4) Adopt a compromise definition. 

Tlie full implications of these options are not immediately apparent; 
obviously they vary in terms of allocation functions at the local versus 
state versus national levels. It should be noted that a characteristic 
shared by most currently used domain-referenced and other criterion- 
referenced systems of measurement is that they were developed in conjunc- 
tion with specific methods and materials of instruction (e.g., Hively et 
al., 1973). For this reason they are imbedded within particular curricula. 
Any nationwide application which attempts to take into account variations 
in curriculum, therefore, appears fraught with problems, given the current 
state of the art. 
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We strongly recommend that the options suggested above and their 
consequences be carefully researched before large investments are made 
in test development and assessment programs. 

Establishment of Performance Standards 

The major questions regarding the setting of performance standards 
for defining ''disadvantage'' are: 

(1) Should the standard be distribution-referenced or based 
on some absolute (or rational) criterion? 

(2) Should a single standard be imposed, or should we adopt 
variable standards which vary according to region, local- 
ity, and curriculum? 

(3) Should the standard be oriented toward age-level or 
experience-level criteria? 

(4) Should separate standards be developed for exceptional 
children (e.g., the retarded and emotionally handicapped)? 

(5) Should the standard be in terms of a cutoff point, or 
should we provide flexibility by using an interval cri- 
terion? 

It should be apparent that answers to these questions will play an 
important role in determining both the ultimate costs and technical 
feasibility and requirements of implementing the program. For example, 
the administrative logistics for a national assessment would be con- 
siderably different in cost and 'tJomplexi ty than for 50 separate state- 
wide, or 16,000 district-wide assessments. And if multiple criteria or 
instruments are employed, the likelihood of methodological difficulties 
would increase by several orders of magnitude. 

The use of a single performance standard is strongly implied by 
HR 5163, The presumed advantages of a single standard are that it would 
provide (1) a uniform definition of educational need, (2) an equitable 
basis for allocating program funds to states (supposedly overcoming the 



1-23 



shortcomings of the multiple poverty criteria) » and (3) a good benchmark 
against which to evaluate program effect. Difficulties of this approach 
are in establishing the standard and in defending It In view of the 
enormous regional and local variation in curricula throughout t^.e country, 
Quie suggests the standard be set by determining '\ , , how well the average 
third (or sixth, or ninth, or whatever) grader should be able to read (or 
solve problems, or whatever) , . /' ( Congressional Record , 6 March 1973), as 
if procedures for such determination currently exist. In fact, short of 
some form of arbitrary judgment, they do not. The creation of a National 
Commission on Educational Disadvantage, as suggested by Quie, for the 
purpose of determining the standard and developing a test would not 
alleviate any of the above developmental problems. 
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V COST ANALYSIS 



Quie, in his testimony, reports that the cost of implementing his 
alternative has been estimated at $5.6 million. In our study of the 
probable costs, we arrive at a figure which exceeds this $5.6 million 
estimate by a factor of 50. This section presents the details of the 
assumptions and results of our analysis. 

Our analysis of issues involved In the implementation of HR 5163 
is broken into two cost categories: (l) instrument development, and 
(2) Implementation (allocation, targeting). 

Instrumentation Costs 

The previous analysis of technological and methodological issues 
pointed up the fact that we are not yet prepared to gather criterion- 
referenced data on a nationwide basis, regardless of subject matter. 
Before such an assessment could begin, considerable instrument-development 
activities would have to be undertaken to develop appropriate measurement 
systems for the content a/eas and grade levels of Interest. 

It may be possible, as Quie suggests, to obtain national estimates 
of performance using just three grade levels; but ultimately these measure- 
ment systems will have to accommodate all grade levels. Such estimates 
could be useful for allocating funds to states, but would not be suffi- 
cient for targeting purposes, unless Quie intends entitlement to be 
restricted to these three grade levels. In short, if we are to employ 
performance criteria to identify educational disadvantage across the age 
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range five to seventeen, we will need appropriate instruments to assess 
performance at each such age; thus, 12 age levels X 2 content areas - 
24 tests. 

The cost of developing criterion- referenced tests is not yet known, 
but can be estimated. The Minnemast Project (Hively, 1973) reports aver- 
age costs for the development of fairly useful domain-referenced tests at 
$50,000 per content area X level. This projects a total cost of $1.2 mil- 
lion, if such estimates apply consistently across the total age range. On 
the other hand, the National Assessment test-development costs were in 
the neighborhood of $1 million per content area, and these tests were not 
designed to be used at all levels. It thus appears that the test- 
development costs alone could exceed $2.0 million. Furthermore, since 
large-scale applications of criterion-referenced measurement technology 
have still to be developed and validated,* the total instrumentation costs 
may ultimately become many times this figure. 

Implementation Costs • 

Our analysis of probable costs of implementing HR 5163 has been 
restricted to those components which would be modified by the amendment. 
As such, these estimates are seen as probable costs over and above current 
administration and operating costs. 

Three alternative programs for implementing HR 5163 might be as 
follows : 

• Alternative A . Develop national performance estimates 
by grade level and content area; reformulate entitle- 
ment based on these data; and require states to supply 
eligibility and targeting data, using equivalent measures. 



As Madaus and Elmore (1973) point out. Quiets assertion that the National 
Assessment and certain state assessment programs (e.g., Michigan) are 
currently using criterion-referenced testing is incorrect. 
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• Alternative B > Develop national performance estimates 
based on representative within-state samples; reformulate 
and allocate resources to states based on eligibility; 
and require states to determine local need, again using 
equivalent measures. 

• Alternative C . Conduct a national assessment using a 
total census sample; reformulate entitlement criteria; 
allocate to states on the basis of eligibility ^ and 
allow states to use the national assessment data for 
purposes of distribution of program resources. 

Alternp«tives A and B involve sampling procedures at the national and 
state-within-nation levels i respectively. Alternative C involves no 
sampling- -the total United States and associated territories* population 
of school children would be tested — but it would be an enormous under- 
taking in cost and complexity. The goals in Alternatives A and B are 
to estimate the distribution parameters of the respective populations (or 
the proportion of the populations below some criterion value) . The cost 
of each procedure can be estimated on the bases of fixed and incremental 
components associated with the data-collection operation. The major cost 
factor is seen as the sample size needed to minimize sampling error: i.e.i 
to obtain estimates of the population of interest (nation^ state ^ dis- 
trict) at a given level of precision and at a given confidence level. 

If we assume a normal distribution ^ and our goal is to estimate 
the mean oX this distribution ^ we can define the following table of sample- 
size requirements (see Table 1). 

Entries in Table 1 can be read as indicating that a random sample of 
27 would provide an estimate of the mean that, 95 percent of the time, 
would be within 5 percent (SD units) of the tnje population mean. Simi- 
larly, a random sample of 29,355 is necessary to estimate the mean at 
the 99 percent confidence level within average error limits of 0.5 percent. 

If wc disregard the shape of the distribution, but wish only to esti- 
mate what proportion meets or exceeds some known value (or criterion) , a 
similar sairpling table can be constructed. 
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Table 1 



SAMPLING REQUIREMENTS FOR ESTIMATING 
THE MEAN OF THE DISTRIBUTION 



Level of Precision 
5% 2% 1% 0.5% 



Level of 


.95 


27 


1,537 


6,157 


17,074 


Confidence 


.99 


46 


2,642 


10,568 


29,355 



For example I Title I currently reaches 15 percent of the population. 
Assuming this level of effort would be retained, the goal becomes that of 
estimating the lowest 15 percent of the population on performance terms. 
The sampling requirements for this estimation problem are as shown in 
Table 2. 

Obviously, these requirements are not g eatly disparate from those 
needed to estimate the mean under assumptions of normality. In either 
case, the consideration of the precision of sample estimates is cle rly 
the main factor in determining the costs of the operation. Two additional 
observations suggest that these costs will be relatively large. 



Table 2 

SAMPLING REQUIREMENTS FOR ESTIMATING 
THE LOWEST 15 PERCENT OF THE DISTRIBUTION 



Level of Precision 







5% 


2% 


1% 


0.5% 


Level of 


.95 


196 


1,224 


4,898 


19,592 


Confidence 


.99 


339 


2,122 


8,487 


33,948 
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(1) Campling errors compound measurement errors. To 
the extent that the tests are unreliable ^ any errors 
due to small samples will further reduce the valid- 
ity of resultant estimates. Thus, sampling errors 
which we have under our control must be minimized. 

(2) Allocation errors, defined as the proportion of 
resources allocated versus the true proportional 
need, will be a direct consequence of sampling X 
measurement errors • 

The importance of minimizing sampling error can be appreciated when 
it is seen that, assuming total program resources to be allocated are 
$1.5 billion, a one percent erro r in estimating needs will result in 
$2 million average allocation error's at the state levels , or $100 million 
overall . 

In developing cost estimates for either of the sampling options 
(national, or state within nation), the following assumptions were 
employed : 

• Testing would occur ac ross three grade levels (as proposed 
by Quie) . 

• Test development represents a fixed cost of approximately 
$100,000 per grade level per content area, or a total of 
$600,000. 

• A standard and uniform test-administration procedure would 
be employed throughout, using independent testing staff 
(not teachers) under high quality assurance requirements. 

• Test-administration and processing costs would be incremental, 
but economy of scale would diminish, reaching an asymptote 

at $5 per pupil under large-scale conditions. Unit cost 
estimates, based on a log scale of declining unit cost as 
a function of sample size, are as follows: 
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Unit Cost 
Sample Size (dollars) 



100 $67 

1 ,000 45 

10,000 30 

100,000 15 

1,000,000 7 



Given these working assumptions* and the previous sample requirement 
table, the estimated cost of generating national performance data 
(Program A) can be determined (see Table 3). 

Table 3 shows that the probable costs of estimating national need 
(or the reformulation of Title I to performance criteria on a national 
basis) will be somewhere between 3/4 and 1-1/2 million dollars, if we 



Table 3 

ESTIMATED COSTS 
OF GENERATING NATIONAL PERFORMANCE DATA 
(Thousands of Dollars) 



Level of Precision 





5% 


2% 


1% 


0.5% 


.95 

Level of 


658.1 


834.4 


1,111.0 


1,418.3 


Confidence 

• 99 


663.8 


887.8 


1,284.1 


1,530.6 



*It should be noted that these cost estimates compare very favorably 
(within 5 percent) to those independently developed for Representative 
Quie. In fact, the only notable difference between estimates supplied 
to Quie and those reported here is in our inclusion of an economy-of- 
scale cost reduction. 
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use just three grade levels. If we sampled all 12 grade levels (but ex- 
cluded kindergarten) the costs would increase fourfold. 

However, Program A does not provide a basis for action. States would 
have to utilize equivalent assessment procedures to develop estimates of 
their respective needs , and use these as a basis for allocation. Because 
of this duplication in testing, Program B appears to be a more sensible 
approach. The estimated costs of implementing Program B (conducting uni- 
form performance assessments on random samples of students across three 
grade levels within each state) are displayed in Table 4 below: 

Table 4 

ESTIMATED COSTS OF CONDUCTING UNIFORM ASSESSMENTS 
AT THREE GRADE LEVELS WITHIN EACH STATE 
(Thousands of Dollars) 



Level of Precision 
5% 2% 1% 0.5% 



Level of 


.95 


812.0 


6,413.8 


9,879.5 


18,577.7 


Confidence 


.99 


891.5 








6,594.5 


13,331.6 


27,069.5 



These figures, which bear more directly on Quie's estimate of $5.6 mil- 
lion, show the cost of developing state allocation data. Quie's figure 
appears to allow an average state allocation error of more than 2 percent, 
or $4 million. Although for some states this might be acceptable, for 
large states (e.g., California and New York), such an error could easily 
exceed $40 million in program funds. But perhaps the greatest single 
deficiency of this sampling approach is that it does not provide a basis 
for targeting funds to districts , schools, and pupils. 



Indeed, the two major components of HR 5163 are that federal assis- 
tance be targeted to individual pupils (in the form of individualized 
programs) on the basis of need defined in terms of individual performance 
compared against a national standard. These two components — the national 
standard and the individual locus — ultimately require that £ single and 
uniform assessment procedure be applied to every potentially eligible 
student . In short , a national performance census is required. 

Quie*s assumption that state assessment programs can be used to 
develop this individual level data is unrealistic and unworkable. Very 
few states perform across-the-board assessments , and those that do use 
broad-range tests and variable procedures. To allow variability in as- 
sessment procedures is to introduce ambiguity into the definition of dis- 
advantage. 

The only alternative, that of a single census assessment, will be 
very costly. If we assume the per-pupil testing costs will average $5, 
then the total test cost will approach $250 million , or 1/6 of the total 
program budget. It may be possible to provide cost-sharing incentives 
to states, but the conclusion is inescapable; HR 5163 will be enormously 
expensive to implement properly. 

As a final note on costs likely to be involved in implementing HR 5163, 
we need only briefly consider the diagnostic and individualization require- 
ments. As proposed by Quie, this feature would involve the assessment of 
individual capabilities, current strengths and weaknesses, and the nego- 
tiation of a set of specific objectives, the attainment of which would be 
accomplished through individually prescribed methods. These are most 
certainly desirable features. But diagnosis of individual need at the 
level implied by Quie*s testimony is a major psychometric undertaking. 
The average cost would probably be in the neighborhood of $50 per pupil. 
The negotiation of objectives involving student i teacher, and parent is 
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complex f and may require a considerable amount of teacher time — possibly 
more than is available in many schools where resources are currently over- 
extended. The use of specialists is not practical, since it is an addi- 
tional cost and not in the spirit of the negotiation. The most costly 
component, however, is the requirement for individualization. It is not 
clear how such individualization could be accommodated in many current 
school systems, but it is clear that resources would be expensive, i^'ollow 
Through projects which employ individualized methods show annual per-pupil 
costs of $800 and more in excess of average instructional expense. Since 
Title I shows an average per-pupil entitlement of $200, the program would 
have to be radically reformulated in scope or appropriations to provide 
for such educational services. In fact, we estimate that, under the current 
Title I appropriations, Quie*s proposed individualized approach could be 
delivered adequately to less than 25 percent of the current Title I en- 
rollment • 

In summary, our analysis of cost factors associated with HR 5163 shows 
it to be an enormously expensive alternative to the current program. The 
required measurement devices and procedures would have to be developed and 
validated at a substantial cost. Also, the gathering of entitlement data 
could easily cost up to 1/6 of the total program appropriation. Finally, 
the diagnostic and individualization features would also add substantially 
to the cost of program services, and would probably require reductions in 
overall entitlement. Since our analyses are based on conservative esti- 
mates of component costs, we strongly recommend that thorough and detailed 
cost and feasibility studies be conducted before adopting HR 5163. 
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VI BENEFITS ANALYSIS 



Our discussion of HR 5163 in the context of a benefits analysis 
focuses on four issues. These are: 

(1) A consideration of general differences between current 
program recipients and those who would be served under 
Quie's proposed revision. 

(2) Some consideration of probable redistribution of program 
allocations and the concomitant social and political con- 
sequences. 

(3) A brief consideration of a potential negative incentive 
component of the program. 

(4) Evaluation of overall Title I improvements due to HR 5163. 

Participant Characteristics 

The principal argument on which HR 5163 rests is that as the Title I 
program currently operates , a large segment of pupils performing poorly 
are excluded from participation in the program because they live in school 
districts (or other administrative units) which do not meet the eligibility 
criteria. The proposed redefinition of eligibility and program mechanisms 
would presumably overcome this condition, since program participation 
would become categorical in terms of student performance as opposed to 
social/economic conditions in districts or communities. 

To examine the implications of this proposed redefinition of eligibil- 
ity in terms of the program recipients and potential program results, it 
is convenient to describe three partially overlapping population sectors. 
These are: 
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• Sector 1 > Pupils eligible for program participation on the 
basis of social/economic criteria. 

• Sector 2 . Pupils showing low academic achievement (or scoring 
below some standard). 

• Sector 3 > Pupils whose observed performance is below their 
potential, regardless of absolute level of performance (i.e., 
underperformers) . 

These three sectors of the elementary and secondary school population 
are diagrammed in Figure !• 



U: ELEMENTARY AND SECONDARY POPULATION 



SECTOR 1 




SECTOR 2 



SECTOR 3 



FIGURE 1 SET DIAGRAM OF POPULATION SECTORS INVOLVED IN TITLE I 
ELIGIBILITY ISSUES 

Several important features of these population sectors are represented in 
this figure: 

(1) Adjacent sectors are shown as overlapping, indicating some 
degree of intercorrelation. 
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(2) The relative overlap of Sectors 1 and 2 with Sector 3 
indicates the upper bound of potential program effect. 
This is because underperf ormance is defined in terms of 
actual versus potential achievement. Logically speaking, 
when a pupil's actual performance level (or rate of growth) 
approaches his potential, he cannot benefit from any such 
program. Because ''potential" is assumed fixed within the 
individual, he is already progressing at a rate consistent 
with his "level of ability." 

(3) The sector defined as "underperformance"* (Sector 3) is in- 
dependent of actual achievement level (high or low). For 
purposes of discussion, pupils who fall within this sector 
are those whose performance does not measure up to their 
ability. 

(4) The overlap of Sectors 1 and 3 is intended to show the rela- 
tive effect of socioeconomic factors on underachievement • 
Thus, only some portion of socially or economically disad- 
vantaged pupils will tend to underperform. The actual over- 
lap could be inferred from a correlation of the two condi- 
tions through use of the coefficient of determination 
(i.e., r2), 

(5) The relationship between low achievement and underperfor- 
mance is similarly portrayed. This means that not all low 
achievers can be improved. Some are doubtless performing 
at their true ability level. Again, the proportion of low 
achievers who are underperformers (i.e., can be improved) 
could be estimated from knowledge of the correlation between 
these two conditions. 

(6) The two "disadvantage" conditions (Sectors 1 and 2) are not 
totally independent, and are diagrammed to show some overlap; 
but they are not perfectly correlated. 

We believe this is a fairly reasonable representation of the concep- 
tual aspects of the eligibility problem. If the intercorrelations among 
these three sectors, as well as the number of pupils within each sector. 



The term "underperf ormance" is chosen instead of "underachievement" be- 
cause the latter term carries many surplus connotations regarding the 
interpretation of achievement. 
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were currently known, it would be possible to determine which eligibility 
criterion — poverty or poor performance — is, in fact, a more accurate proxy 
for underperformance. 

Given the substantial costs associated with collecting performance 
data on a nationwide scale for use in targeting, it is apparent that over- 
lap between Sectors 2 and 3 must be significantly larger than that between 
Sectors 1 and 3, in order for the performance-based targeting approach to 
be preferable to the current approach from a cost-effectiveness viewpoint. 

Even in the absence of empirical data regarding these parameters, some 
reasonable estimates can be made. For example, it is likely that the sizes 
of Sectors 1 and 2 will be nearly equal, no matter what. This is because 
there are finite resources which are allocated on a per-pupil basis. Thus, 
category definitions tend to be modified to match the resources, rather 
than the reverse. The question then becomes "Which sector of pupils should 
the program be designed to assist?*' The answer to this question must in- 
clude consideration of legislative intent, for its answer ultimately deter- 
mines the policy of the program. 

Redistribution Consequences 

Since we already know that poverty and low performance are only 
partly correlated, it follows that substantial changes in the flow or dis- 
tribution pattern of program resources would accompany Quie's proposed 
change in the definition of eligibility. It also follows that any such 
change will have to reduce the amount of funds available to the poorest 
districts, and to some extent, increase the funds available to more af- 
fluent districts. A general question then can be raised regarding the 
likely political and social consequences of this redefinition. Clearly, 
suburban areas which are virtually excluded from Title I participation at 
present will experience some support under the revised definition. A 
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question might be raised whether a primary source of this new support 
would be a shifting of resources from central city urban school districts , 
from rural nonindustrial districts , or from both. A second level of con- 
cern might regard the general pattern of redistribution or reallocation 
at the state level. For example, it has been known for some time that on 
the basis of draft induction examinations^ states throughout the nation 
order themselves in a quite unmistakable pattern^ with Mid-West farm states 
being at the upper end and Southern states occupying the low end of the 
distributions. If this pattern repeats itself with pupil-performance data, 
then some flow of resources from non-Southern to Southern states would 
occur. Anticipated redistributions of Title I funds have resulted in the 
emergence of what are called **hold harmless** provisions. These provisions 
essentially guarantee that at least some proportion of current allocations 
will remain in situ, regardless of changes in eligibility brought about 
either by population or economic shifts, or by reformulation of entitle- 
ment • The details of probable redistributions and some discussion of their 
likely consequences are presented in a companion paper to this. 

Negative Incentive 

As mentioned earlier in this p^per, concerns have been raised regard- 
ing the potential of the Quie Bill • r producing a general negative incen- 
tive. Briefly, according to this perspective, programs which succeed in 
promoting positive pupil growth (so that pupils perform at or above the 
eligibility criterion), would risk loss of future funds, whereas those pro- 
grams least successful in stimulating growth would be virtually assured 
of continuing allocations at or above their current levels* Although the 
problem of potential negative incentive applies to most programs of social 
assistance, its potency is hard to estimate in this instance. One possible 
mechanism for resolving this dilemma would be to provide for the continued 
support of those approaches which most effectively removed evidence of 
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disadvantage and to require the installing oi new approaches where evidence 
ot improvement failed to occur. 

Overall Evaluation 

The evaluation of the general approach embodied within HR 5163 can 
be viewed as a study of the feasibility of conducting effective large-scale 
social intervention programs at the federal level. Given the current de-* 
gree of state and local educational autonomy and the enormous concomitant 
variability in administration , governance, curriculum, and so forth, the 
following question becomes important: To what extent can or should a 
federally administered program uniformly influence state and local educa- 
tional policies, practices, and outcomes? A subsidiary question becomes: 
To what extent are major provisions and guidelines of such a federal pro- 
gram being uniformly interpreted and followed by state and local agencies? 
One factor which deserves special consideration is the extent to which 
the use of performance criteria will result in any appreciable improvements 
in program administration and results over the current criteria. As sug- 
M;ested in previous discussions, such demonstrated superiority would be 
necessary to offset the additional costs of obtaining performance data. 
Rut when we realize that the variance in appropriateness at the local level 
ul a national performance standard probably greatly exceeds that of a na- 
tional economic standard, such improved administrative effectiveness seems 
nt best questionable. 
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VII CONCLUSIONS 



Based on our study of ESEA Title I and the proposed alternative, 
HR 5163, the following conclusions may be advanced. 

• Title I, as currently administered, reveals a number of de- 
ficiencies and problems in the equitable allocation of re- 
sources, in the targeting of recipients, and in the delivery 
of program services. Current allocation and entitlement pro- 
cedures are, with much Justification, criticized as somewhat 
invalid (the relationship between poverty and educational 
need is only moderate); unreliable (census data become obso- 
lete too soon; welfare data are too variable); and often 
inequitable. Also, because services are targeted to LEAs 
instead of individual pupils, program resources tend to be 
diffused at the LEA level, and many children who show need 
do not receive program benefits because they are enrolled 

in schools which do not meet current eligibility criteria. 

• Quiets amendment (HR 5163) is a bold attempt to remedy these 
program inadequacies both by defining educational disadvan- 
tage in test-performance terms and by focusing program services 
at the level of the individual student. 

However, Quiets solution constitutes a fundamental reformula- 
tion of ESEAy Title I, and contains a number of major, complex 
and costly implications. For example^ it would: 

- Constitute a change in Title I*s legislating philosophy 
from equal educational opportunity to equal educational 
performance. 

- Require the specification of national-level standards for 
educational attainment, and perhaps impose national cur- 
ricula . 

- Require the development and utilization of uniform criterion- 
referenced test procedures on a massive and enormously ex- 
pensive ($250 million) national scale. 

- Probably cause major redistribution of resources because of 
varying SES, community and state characteristics. 
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- Be dependent on the diagnosis, prescription and administra- 
tion of costly individually planned instructional programs, 
and would be expected to increase per-pupil program expendi- 
tures substantially. 

- Provide negative incentive which might act against effective 
approaches i 

Our conclusion is that the basic features of HR 5163 represent 
desirable goals and procedures which could be incorporated 
within educational programs designed at the level of LEA or 
SEA. They do not, in our opinion, represent acceptable solu- 
tions for Title I targeting or allocation problems, and we 
recommend that this amendment not be adopted. 
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Appendix A 

NORM-REFERENCED MEASUREMENT IN AMERICAN EDUCATION 

Over the last decade a movement advocating a major reform In educa** 
tlonal measurement and evaluation practices has been gaining In both mo*» 
mentum and support. The roots of this movement stem from a relatively 
widespread dissatisfaction with standardized testing programs and their 
attendant limitations and restrictions. By far the most common applica- 
tion of standardized testing programs Is the assessment of student prog*- 
ress In general Intellectual development — namely scholastic achievement — 
at selected points throughout the student's academic career. Standardized 
testing is big business In American education today. A typical student 
may expect to be assessed by a standardized achievement battery six times 
throughout his academic career, and certainly more often If he proceeds 
to higher education. In all, current estimates place the number of stan- 
dardized tests administered within a given year at somewhere between 150 
and 250 million (Mason, 1973). 

In order to appreciate fully the extent to which standardized 
achievement testing Is entrenched In the current American educational 
system, It Is necessary to review Its history briefly. According to 
most experts, the practice of utilizing a single comprehensive Index of 
educational progress— namely , an achievement test score, Interpreted In 
an age-placement, grade-placement, or distribution-referenced (per- 
centile, stanlne, CEEB, T, or other standardized) scale — Is a relatively 
recent phenomenon. Its origin Is traceable to the ability-testing move- 
ment which was a byproduct of the standardized ability-screening 
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procedures (the army alpha tests) developed during World War I. What 
these army alpha tests showed was that individual differences In complex 
traits such as general intelligence could be reasonably well measured 
by group-administered paper and pencil tests. It was a relatively short 
period of time before these methodological developments were incorporated 
into educational testing prpctices, and with the appearance of the 
Stanford Achievement Test in 1923, the era of standardized achievement 
testing was launched. 

It is not hard to understand why standardized testing was so readily 
adopted within the American educational system. It is relatively com- 
patible with contemporary American educational philosophy , which supports 
uniform comprehensive instruction for all pupils. Since achievement 
tests are designed toward breadth rather than specificity in content , 
they can be used to provide useful general indexes of overall educational 
progress throughout the nation , under the assumption that regional or 
local variations in curricular emphasis will balance out. The principle 
of equal opportunity for all constituents , which guides the American 
educational system, virtually guarantees more or less uniform educational 
experiences to all consumers in the form of elementary and secondary 
schooling. In this sense, the American educational system is perhaps 
unique in the world. Educational systems in other cultures and societies 
frequently channel individuals into differential educational programs on 
the basis of relatively well-defined criteria such as ability, class, and 
need. Since such channeling has determining consequences regarding 



Ironically, these screening and classification tests, which supplied 
the technology for the subsequent proliferation of aptitude and achieve- 
ment testing, more closely approximated criterion-referenced tests than 
norm-referenced tests in their purpose and use. 
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subsequent occupational and social status, a great deal of emphasis is 
placed on the decision-making criteria at the level of the individual. 
Performance tests generally play a significant role in the decision- 
making process. An example is the set of comprehensive tests known as 
the "eleven plus'* examinations administered to English pupils. The type 
of secondary school a pupil will attend is highly dependent on his per- 
formance on these tests (along with his socioeconomic background). 

In the United States, however, no such formal career/occupational 
branch points can be found. The educational institutions up to and in- 
cluding college are available to every member of the society, and most 
educational decisions which help to define the range of subsequently 
available careers are ostensibly left to the individual. With respect 
to the individual student, standardized tests may serve to guide decisions, 
but they are generally interpreted in conjunction with other measures and 
predictors of future performance. For example, when university officials 
must select incoming students from among a group of applicants, they 
generally take into consideration not only scores obtained on standardized 
entrance examinations but also high school transcripts and other indica- 
tions of scholastic wnd extracurricular background. Achievement testing 
then functions primarily as a feedback device at the most general level — 
describing how the system as a whole is operating in the production of 
intellectual aad academic growth. 

Development of Stg ndardized Tests and Norms 

Nearly all examples of educational and psychological measuring; in- 
struments that have emerged in the last half century can be classified 
as norm-referenced. In the field of psychome tries, how a measuring in- 
strument is classified depends primarily on how the instrument was 
initially developed and how it is used in the measuring situation. As 



1-49 



will be shown later, it is virtually impossible to distinguish a norm- 
referenced test from a criterion-referenced or domain-referenced test on 
the basis of mere inspection of the instrument* 

The fundamental theorem underlying all distribution-referenced 
measurement* is that on any well-defined trait or ability individuals 
will differ or distribute themselves along some continuum of knowledge 
or skill attainment. The goal of measurement, therefore, becomes not so 
much the task of assessing the magnitude of a given trait within an in- 
dividual, but rather that of characterizing or distributing individuals 
along this assumed underlying continuum. With this goal in mind, two 
considerations are foremost in the development of such a distribution- 
referenced test. The first is to obtain a valid set of indicators for 
this assumed underlying continuum. The second is that of obtaining 
maximal discrimination of individuals along this continuum. Hence, the 
utility of a given measure depends on the extent to which it reliably 
differentiates or "orders'* individuals along some psychological or be- 
havioral continuum and the extent to which performance on the test can 
be argued or demonstrated as validly representing this continuum. For 
the majority of contemporary educational measures , the validity of the 
test is assured through consultation with subject matter experts and 
development of items based on commonly used curricula. The reliability 
of these measures—that is, the ability of these measures to order 
examinees consistently — is established through item analysis procedures. 
An ideal distribution-referenced test, then, is one which (1) consists 
of a representative sample of items relevant to the domain in question, 



*The labels 'norm-referenced*' and "distribution-referenced" can be used 
interchangeably in describing conventional educational measuring 
devices. 



1-50 



and (2) has the property that responses to items are only moderately 
intercorrelated (optimal r = .5) and display moderate difficulty in terms 
of probability of pass (optimal level = .5). Under such conditions, the 
test displays maximum spread of examinees (reliability) and can be argued 
as a valid mechanism for distributing individuals on a trait or dimension 
of interest. 

Since at best this approach to measurement will only yield a rank 
ordering of individuals, the use of norm groupings is introduced to pro- 
vide a basis for interpreting performance on these measures. A norm 
group consists of any well-defined population of examinees which has 
been distributed on the instrument in question and whose performance on 
this instrument can be used as a reference for interpretation of new 
data. It is very important to recognize that a norm is not in any way 
equivalent to a standard; rather, a norm is the performance distribution 
on the measuring instrument of a selected sample of examinees, collected 
at a certain time and under certain testing conditions. To the extent 
that these prior conditions— the representativeness of the sample, the 
testing conditions and the time of testing— are relevant or can be 
extrapolated to recurrent testing conditions, the norm will be useful as 
a basis for interpreting subsequent distributions of scores on the in- 
strument. Thus it is possible to generate virtually an infinite number 
of norms for any given test. As a practical matter, however, achievement 
battery norms supplied for conventional standardized achievement tests 
are obtained on more or less (quite often less) representative samples 
of pupil populations sampled across the nation at various grade-level 
strata. Also, the content of conventional standardized achievement tests 
is generally sampled from common curricula contained in instructional 
materials used at the time of test development and supplied by subject- 
matter experts. Thus, the ideal achievement test would represent a 
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cross-section of all curricula within a given content area such as 
mathematics or reading; and the test would be standardized on a nationally 
representative cross-section of pupils. 

Distributional normative Interpretations are obtained by comparing 
the performance of a selected sample to the performance of the norm 
sample at the same age and grade levels. Age- or grade-placement Inter- 
pretations represent the typical (mean) performance of different strata 
of the standardization sample on the test. For example, a norm-referenced 
interpretation of performance at the third-grade level on a reading test 
would likely mean the average score of third graders on this instrument 
at some point during their school year. To say a first grader can 
score at the third-grade level is not to say that he is doing third- 
grade work, but rather that his score equals the average of third graders 
on this test. 

Limitations and Criticisms 

As should be evident from the preceding considerations, the principal 
vulnerabilities of this distribution-referenced me»^suring approach are 
found in (1) the validity of the test content to the testing problem at 
hand, and (2) the representativeness of the norming sample in the develop- 
ment of standardization data. The most frequent criticisms one encounters 
regarding distribution-referenced tests are (1) that the test is composed 
of items sampling content which is not part of the curriculum repertoire 
of the examinees, and (2) that population or distinct subgroup differ- 
ences exist between the norm sample and the subgroup sample* 

These criticisms are generally aimed at particular instruments. 
Arguments which have been advanced against the distribution-referenced 
approach in general will be discussed below. 
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Both of these criticisms have potentially profound implications 
regarding the interpretation of performance on such a test. The first 
criticism, that of content or construct validity, is plausible since we 
do not at this time maintain a national curriculum. Educational autonomy 
is highly valued and deeply entrenched in the concept of the neighborhood 
school. Hence, national-level measuring approaches can only sample from 
a fairly wide spectrum of local curricula, with a consequent reduction in 
the interpretive validity of any performance on such nationally developed 
tests. It may be argued, however, that achievement tests probably do 
validly assess intellectual development on a relatively general level. 
That is, to the extent that curricula within an individual district or 
school are promoting Improvement In performance along relatively general 
educational dimensions or learning dimensions, the performance pf the 
test sample on an instrument should validly reflect their position along 
this dimension. But since items which make up standardized tests cut 
across many domains, and since the tests are designed to provide maximal 
distribution of examinees, the precision of estimation of performance of 
any individual on such a test is very low. The stability of a group of 
individuals on such a test is relatively high, since individual varia- 
tions tend to balance out within the group. As such, standardized 
achievement tests have relatively low utility for individual diagnosis 
and assessment of Individual progress. 

The implication of the second criticism, that of the bias or non- 
representativeness of the norming sample, is more difficult to assess. 
What we do know regarding procedures, however, suggests that this may 
be the more serious category of problems. Publishers generally attempt 
to gather normative data on a reasonably representative cross-section 
of the population. However, normative data are generally gathered on 
a strictly voluntary basis, resulting in a compliance or participation 
rate of approximately 60% of the original norming samples. If 
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participation In normlng studies Is differential as a function of any 
meaningful population or ability dimensions, then serious biases or 
Invalidities exist In the resultant norm data. An example of these 
potential biases Is the possibility that teachers might choose to report 
only data from their superior students. This would naturally displace 
the distribution of scores upward, so that the mean of the norm sample 
would be quite different than the mean of the more representative natlona 
sample. Another example might be biases due to regional differences In 
curricula that were not sampled in the norm group. Such biases could 
easily restrict the range of the norm distribution. Further complica- 
tions could occur with respect to cultural subgroups not adequately 
represented In national norms. Moreover, any systematic performance 
differences that are Influenced by cultural factors are clear sources of 
Invalidity In the measure Itself, This potential bias has been long 
recognized, and attempts to deal with It have emerged In the form of 
culture-fair or culture-free measures. However, as has been noted, most 
of these efforts yield tests that more closely approximate IQ than 
achievement dimensions. 

Finally, some questions relate to the utility of subgroup com- 
parisons on distribution-referenced tests. As indicated previously, 
distribution-referenced tests are not designed to measure the amount 
learned or achieved as such, but only how the examinees differ as to 
abilities or behaviors. When these kinds of measures are used to assess 
the performance of a particular group, judgments can only be made w:.th 
respect to the performance of some norm group. Thus, the Interpretation 
is always relative. The norm group may be at or below some desirable 
level, but this fact would never be detectable from a norm-referenced 
measure. The country as a whole may be producing poor readers, A norm- 
referenced test would only tell us whether and to what extent our 
examinees were poorer or better than some normlng group, 
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CATEGORIES OF CRITERION -REFERENCED MEASUREMENT 

Objectives-Based Testing Is generally thought of as a byproduct of 
the programmed Instruction trend of the early 1960s. With the advent of 
the technology for programmed Instruction, the need for clearly defined 
branch or terminal points In Instructional sequences became apparent. 
At the same time, renewed emphasis was placed on the development of 
operational language for specifying educational goals In the form of 
behavioral objectives. The rationale appears to be as follows: To the 
extent that educators can clearly specify In behavioral or equivalent 
terms those short-range educational goals which their curricula comprise, 
they will be better equipped to monitor both the progress of their 
students and their own effectiveness In attaining long-range educational 
goals. Measuring attainment of such objectives becomes a matter of 

(1) Defining the criterion or terminal behaviors relevant to 
the objectives. 

(2) Defining the measurement conditions under which assessment 
of the objectives would be made. 

Th(/ biggest shortcoming of this objectives-based approach appears 
to be the possibility that objectives will be measured qua objectives — 
that the measurement process will not be carried beyond the specific 
short-range objectives whose relevance to long»range goals is not neces- 
sarily established (Baker, 1973). 

Mastery Testing represents an attempt to overcome the principal 
shortcomings of objectives-based testing by linking the measurement 
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procedure (test content and interpretive criteria) to relatively well- 
formulated models or theories of learning and instruction. 

Most often mastery testing is developed and implemented within 
some decision-theoretic framework where test decisions are individualized 
and where the measurement goal is that of systematically ovaluat;ng student 
progress toward attainment of explicit educational objectives (Emrick, 
1971), Substantial controversy still rages regarding whether mastery 
should be considered a binary (on-off) or continuous phenomenon. At a 
more applied level, the controversy translates to the following questions: 

• How to define the performance level required to exhibit mastery. 

• How to estimate measurement error and reliability on mastery 
tests. 

• How to validate masi y criteria, 

• Whether different criteria ought to exist for different 
examinees* 

Until the issues raised above can be sufficiently resolved, mastery 
testing will probably not be a practical alternative to more conventional 
teacher-made tests (similar to the objectives-based concept). Further- 
more, the mastery -testing approach applied to group-lesting situations 
(e.g., classroom testing) would result in interpretations such as **pro- 
portion (or number) of students reaching mastery," instead of ''average 
level of attainment.*' 

Domain-Referenced Testing is designed to assess the magnitude or 
degree of attainment of some well-defined skill or psychological domain 
by the examinee. Although items on distribution-referenced tests can 
generally be grouped into domains, the norm-referencing procedure anchors 
the interpretation of test scores to the performance of the norm group, 
and hence does not allow judgments regarding differential attainment 
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of the domain in question in an absolute sense. With domain-referenced 
testing, performance of the examinee can be directly generalized to the 
educational or psychological domain being measured, and will produce 
an interpretation in the form of '^amount known/' Unlike mastery testing, 
domain-referenced measurement does not require the use of standards or 
cutoff scores, although such criteria may be used. In a sense, norm- 
referenced and domain-referenced testing procedures are complementary, 
and it is conceivable that a single test could be used in both applica- 
tions. 

Construction and validation of a domain-referenced test are fairly 
imposing tasks. However, this approach offers several advantages over 
other measurement methods. Baker (1973) argues that rules for determining 
the range of content for a domain specify the characteristics common to 
eligible examples of content, and thus help to focus test construction 
on generalizable skills. For this reason, domains are less subject to 
triviality than are objectives, which may be based on only a single 
example of content. 

It would appear that some domains lend themselves to this testing 
approach more readily than others. For example, Hively et al. (1968) 
have shown that mathematics is a domain particularly amenable to this 
testing methodology. More recent developments have shown that reading 
is also amenable to domain-referencing, although relevant domains are not 
as precisely definable as those in mathematics. Recent work in the area 
of affective growth and development (Duncan, 1971) has yielded some 
promising results regarding the definitions of domains. 



1-59 



SRI Protect 2158 July 1974 



Educational Policy Research Center 
JAMES W GUTHRIE ANNE S. ►=RENTZ, and RITA M. MIZE 



Research Report 
EPRC 2158-5 



THE USE OF PERFORMANCE CRITERIA 
TO ALLOCATE COMPENSATORY 
EDUCATION FUNDS 

Fiscal Issues: Resource Redistribution 
Consequences of the Use 
of Performance Criteria 



Prepared for 

OFFICE OF THE ASSISTANT SECRETARY 
FOR EDUCATION 

DEPARTMENT OF HEALTH, EDUCATION 
AtjD '.VELFARE 
WASHINGTON, D.C. 20202 



ERIC 



ACKNOWLEDGMENTS 



We wish to express our sincere appreciation to the Educational 
Testing Service for permitting us to use its Anchor Test Survey tables. 
In addition, Dr. John C. Bianchini and Dr. Carol Vale of the ETS staff 
provided enormous assistance in interpreting standardized tests and 
making necessary adjustments in our interstate analysis. Hr. Bianchini 
was the Director of the ETS Anchor Test Study upon which we relied heavily. 
Substantial additional testing expertise was provided by Dr. William E. 
Coffman, Director, Iowa Testing Programs. 

Dr. Alex I. Law, Chief, Office of Program Evaluation, California 
State Department of Education, is in a sense responsible for making the 
project possible since it was he who secured the cooperation of the Chief 
State School Officers. Without CSSO endorsement we would not have been 
able to obtain the data needed for this analysis. 

Assistance was also provided by individuals in state education 
departments in each of the states included in our analyses. In California 
we were aided by Mrs. Virginia Miner of the Financial Accounting Office 
of the State Department of Education, who explained in detail the manner 
in which Title I allocations were made. Mr. Robert Proctor provided us 
with the needed test score information on California's school districts 
as well as making timely suggestions regarding the structure of the analysis 
The California State Education Department testing division, directed by 
Dr. Dale Carlson, assisted in interpreting and comparing these scores. 
In the same office, Mrs. Birdie Boyles provided information on several 
occasions. 



Il-iii 



In New Mexico we were aided by individuals in several offices. Dr. 
Gilbert Martinez, Director of the Title I Office, Dr. Alan D. Morgan, 
State Director, Evaluation, and his assistant Mrs. Jean Padilla, Assess- 
ment and Testing, B^^ve generously of their time in explaining the New 
Mexico program. The mechanics of providing data were accomplished by 
Mr. Joseph Garcia at the University of New Mexico. 

Delaware data came to us through the goodwill of Dr. William I. 
Corkle, Supervisor, ESEA Title I, and Mr. Robert Bigelow of the Division 
of Research, both acting at the instruction of Dr. Kenneth C. Madden, 
Superintendent, Delaware State Department of Public Instruction. 

We are indebted to Dr. Lorne H. Woollatt, Associate Commissioner 
for Research and Evaluation, New York State Education Department, for 
his comments and data regarding New York's county analysis comparing 
poverty and test score criteria. 

In addition to Dr. William C. Coffman of the Iowa Testing Programs, 
several individuals in the Iowa State Department of Education provided 
critical assistance required for the Iowa analysis. We especially wish 
to thank Dr. Oliver Himley, Director, Title I Office, and his assistant, 
Mrs. Jean Smith, who spent numerous hours copying figures by hand which 
could not be otherwise reproduced. 

In Massachusetts we were aided by Dr. James F. Baker ^ Assistant 
Commissioner for Research and Development, and his assistant, Mrs. Nancy 
Badore, and by Dr. Robert Jefferies of the Title I Office. 

We wish to thank Carl Fischer of the Hawaii State Department of 
Education who provided us with that state's test scores. 

We are particularly appreciative of the constructive criticisms 
offered by Mr. Christopher Cross, Minority Counsel, House Committee on 
Education and Labor. 



Il-iv 



We are grateful to Dr. Forrest Harrison of the National Center for 
Educational Statistics for providing alternative poverty criteria. Mrs. 
Carolyn Horner of USOE's Title I Office also made available many of the 
necessary charts. Both individuals discussed the analyses with us at 
several crucial points. 

Two individuals wrote the computer programs without which the analysi 
could not have been perfected. Dr. John Wells of the University of 
California Computer Center Consulting Service and Mr. Jerry A. Smith of 
the same office spent long hours on short notice in order to produce 
necessary figures. 

Finally, we wish to thank Miss Jane Beaumont and Mrs. Olive Simms, 
who typed the manuscript, including the many tables, over and over. 



II-v 



I INTRODUCTION 



Poverty or pupil performance, which is the better criterion for dis- 
tributing money for schools? which is the better measure of that elusive 
condition, "educational need?" Which measure is the most consistent with 
Che function of schools? Which distributes school funds in the most 
equitable manner? Which is the most acceptable politically? 

These are a sample of the finance issues being argued with increas- 
ing frequency by school policymakers. The outcome of the debate is far 
from inconsequential. Upon the answers to such questions hinges the level 
of educational service to be offered to millions of school children, the 
numbers of teachers and administrators who will be hired or who will need 
to find new jobs, and the level of financing available to thousands of 
school districts across all fifty states. 

Study Purpose and Method 

Our purpose in this paper is to suggest answers to at least a few 
of the foregoing questions. Specifically, in what follows, we examine 
the distributional consequences of shifting from a poverty to a pupil 
performance allocation criterion. Our conclusions are briefly summarized 
in Section II of this report. The subsequent sections provide the results 
from the following analyses. 

By using information from 13 states that have statewide testing 
programs, we compare the flow of federal funds under the present ESEA 
Title I proverty formula with the dollar distribution that would re«:ilt 
from using pupil performance measures. Additionally, by using data frcn 
the 1970 Census, we assess the distributional consequences of changing 
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the present Title I low income formula to alternative definitions of 
poverty. By using computer simulations^ we illustrate these funding 
shifts under various allocation criteria for school districts within 
states and among states themselves. 

Within states we focus further on the degree to which Title I 
formula changes would be advantageous to cities, suburbs, or rural areas. 
Moreover, we analyze the distributional effects of poverty criteria-- 
both the present ESEA formula and proposed poverty measures — versus per- 
formance indicators for ethnic groups such as Blacks ^ Chicanos, Asians, 
and Indians. After describing the results of such analyses, we conclude 
with simulations of our own funding proposals and offer our opinions as 
to the merits and demerits of the two sets of criteria. 

Details of our findings for California only are included in the 
main text, but significantly different results from other states are so 
noted. The detailed results for the other 12 states are contained in 
the appendices. 

Background 

School policymakers and finance experts are continually engaged in 
a search for better measures by which to determine the distribution of 
school resources. To be adopted and incorporated into statute, a new 
distribution measure must wi'hstand an awesome battery of tests: for 
example, it must be rooted in readily available information, it must bear 
some relation to schools, and it must disperse funds in a politically 
acceptable pattern. 

Until recently, school finance distribution formulas that met such 
tests were relatively devoid of concern for the personal characteristics 
of pupils. The federal government, states, and local school districts 
disbursed funds as though all pupils were alike. The only widespread 
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exception to this practice occurred in states that provided additional 
funds to school districts for secondary school students ^ on the assumption 
that their schooling was^ and should be, more costly than the services 
delivered to children in the early grades. 

Passage of the Elementary and Secondary Education Act in 1965 marked 
a turning point in school finance , not only because of the unprecedented 
magnitude of the federal funds involved but also because » for the first 
time, a major distribution formula was based , however clumsily , on the 
personal characteristics of students. Embedded within the ESEA Title I 
formula was the notion that children from low income families needed 
more school services if they were to compete equally with their more 
fortunate counterparts from higher income families. The formula provided 
in a general way that school districts with concentrations of low income 
children should spend half again as much for their schooling. 

Even though passed in 1965 , the poverty component of the Title I 
formula his never been free of debate. During enactment, formula oppo- 
nents argued that income measures were inappropriate. Since 1965, the 
debate has expanded as more and more state legislatures have considered 
one or another pupil weighting provision as part of revised school finance 
formulas. The same questions arise upon occasion even within school 
districts. For example, the New York City decentralization plan permits 
schools with an added burden of pupils with low reading scores to receive 
preference in the distribution of district resources. 

In 1971, the Michigan legislature decided in favor of using test 
scores as an allocation criterion. At* present, a school district in 
that state may receive up to $200 annually for each pupil who scores in 
the 15 percentile or below on a state administered reading achievement 
test. The Fleischmann Commission, charged with examining New York State's 
educational system and suggesting directions for future policy, recommended 



a school finance formula that relied heavily upon pupil test measures: as 

1* 

a criterion for distributing state compensatory education funds. 3im« 
ilarly^ a report recommended that California implement a formula in which 
compensatory education funds were allocated on the basis of pupil test 
scores y as well as measures of their family's income. 

During the 93rd Congress , debate on poverty versus performance measures 
became more intense because of a proposal before Congress to alter the 
ESEA Title I distribution formula. In March of 1973, Congressman Albert 
Quie, ranking minority member on the House Education and Labor Committee, 
introduced H.R. 5163. This bill, among other features, proposed the 
foundation of a National Commission on Educational Disadvantage. The 
Commission was to be charged with overseeing development and administration 
of tests that would assess pupils* level of reading and mathematics 
comprehens? on within each state and for the nation a^ a whole. The bill 
intended that the results of such a testing program be substituted for 
Census and other family income measures in the Title I distribution formula. 

In congressional hearings following introduction of H.R. 51S3, a 
number of questions were raised regarding the consequences of distributing 
school aid dollars on the basis of test scores. For example, some opponents 
of the bill argued that the state of the testing art is too primitive to 
permit assessment of student learning with the necessary level of precision. 
Others expressed the fear that payments based on low achievement would 
act as incentives for school personnel either to manipulate test score 
results directly or not to teach students so that they would score poorly. 
Opposition was also voiced by those fearful that financial payments attached 
to reading and mathematics performance would distort the school curriculum 
to the detriment of art, music, history, and other subjects outside the 
strict confines of the '^3 Rs 



The references are listed after the final section of this report. 



Supporters of the Quie proposal argued that many of the previously 
cited disadvantages could be overcome.^ Moreover ^ they asserted that the 
bill had two principal advantages. Firsts it would base aid on a school 
related measure. They reminded their adversaries that the correlation 
between poverty and low scholastic performance is not perfect; not every 
child from low income families performs poorly in school. By using tests 
as an allocation criteria, money would be focused on students who most 
need it. Second, the bill's proponents pointed to the flaws embedded in 
the present income distribution criterion. Published Census data are con- 
sistently four to fourteen years out of date and AFDC poverty measures 
are not standard among and within states (Appendix A discusses these 
problems further); current practices do not prevent dishonesty in counting 
low-income children, a number of low-income children are not even enrolled 
in school — or if enrolled, do not attend regularly. 

The debate has been further complicated by those who would retain 
poverty and reject test scores as an allocation criterion, but who desire 
that a suitable alternative to the present poverty measure be identified 
and embedded in ESEA Title I. In this regard, changing the annual income 
figure from $2,000 or $3,000 to something higher — $4,000, $5,000, or $6,000 
has been suggested. More radical yet, it has been recommended that family 
annual income be rejected altogether and be replaced by a more complicated 
index of family poverty. 

At this writing (Summer 1974) , the House and Senate have compromised 
on a new measure of poverty, a combination of the so-called Orshansky 
index (see Appendix B) and a count of welfare children. Congress has 
rejected the pupil performance allocation proposals. However, the new 
Title I formula will not end the debate: This new poverty measure Is 
itself imperfect, and continued efforts to revise it are likely. In 
what follows, our intent is to inform the future debate by suggesting 
answers to questions regarding the distributional effects of the two 
allocation strategies. 

II-5 



II PRESENT POVERTY VERSUS TEST SCORE CRITERIA 



In this section we describe first the shifts of Title I funds that 
would take place within states if the original poverty formula were ex- 
changed for one based on test scores; we then describe possible shifts 
of funds among the states. 

Intrastate Analyses 

In order to simulate such change within a state, we first identified 
all states that conduct pupil testing programs,^ From this universe, we 
selected those states whose test results permitted generalizations to all 
school districts within their borders. We were then able to compare 
school districts* low income children relative to their low scoring 
children. 

A total of seven states (see Table 1) provided data upon which to 
simulate the intrastate redistributional effects of switching to a pupil 
test score criterion for Title I allocation. Summary tables of the find- 
ings for California, and interpretations of these tables, are given in the 
following pages. Highlights of our findings for the other states are also 
presented, but the summary tables for them appear in Appendix D. 

California's School Districts 

Table 2 categorizes California's school districts in terms of the 
proportion of their students at present eligible under Title I and then 
displays the gains and losses that would occur if various test score 
percentiles were used as the definition of low achievement for purposes 
of determining Title I allocations. Column 1 contains eight classifica- 
tions of California school districts, based on the proportion of their 
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Table 1 

STATES AND TESTS USED IN INTRASTATE ANALYSES 



State 



Alabama 

California 

Delaware 

Hawaii 

Iowa 

Masachusetts 
New Mexico 



Test Grade 

CAT* 4 

CTBS''' 6 

STEP* 4 
STEP 

ITBS§ 6 

CTBS 4 

CTBS 5 



Level and Form 

Level 3^ Form A 
Level 2f Form Q 

Forms 5 or 6 
Level 1 f Form Q 
Level 2f Form R 



Date Tested 



Spring 1973 
Spring 1973 
Spring 1973 

February 1973 
January 1971 
October 1972 



California Achievement Test* 

t 

Comprehensive Tests of Basic Skills. 

* 

Sequential Tests of Educational Progress (in Delaware^ modified to 
state specifications). 

§ 

Iowa Test of Basic Skills. 



students who are Title I eligible. For example^ in Row 1 of Column 1 are 
those districts that have from none to 5 percent of their enrollments 
eligible for Title I benefits. Column 2 displays the fact that there are 
a total of 264 school districts in the state which fall into the 0-5 per- 
cent category. Column 3 states that those 264 school tlistricts have a 
combined pupil population of 845 i 290. Column 4 displays the proportion 
of all California's Title I funds received by school districts in this 
0-5 percent category: these 264 districts collect 4.26 percent of all 
California Title I money. 

Columns 5 through 10 display the results of our simulations. Each 
of these columns represents a different test score criterion — percentile 
cutoff point — for statewide achievement tests. Within each column is 



displayed the proportion of California's total Title I appropriation which 
school districts in that category would receive if the low achievement 
allocation criterion were established at its particular percentile level. 
For example, in Column 5, we see that the 264 districts in the 0-5 percent 
range would receive 7.73 percent of California's Title I money if a 10th 
percentile cutoff point were used for distributing funds. These same 
districts, Column 10 shows, would receive 11,73 percent if the criterion 
were changed to the 35th percentile. Allocations at each of these per- 
centile intervals, from 10th to 35th, should be contrasted with the 
proportion appearing in Column 4, Notice that districts which at present 
have relatively few poor children (the 0-5 percent category) would benefit 
substantially in added Title funds should an achievement cri terion be 
used instead of the present income measures . 

In contrast, districts that are heavily populated by poverty children- 
35 percent or more of enrollments, as shown in Row 8--at present receive 
7.71 percent of all of California's Title I funds. Columns 5 through 10 
show clearly that these 26 districts would suffer from the proposed move 
to a low achievement allocation criterion. The proportion of statewide 
Title I funds they would receive would drop to 6.15 percent under the 
10th percentile low achievement cutoff, and to 4.53 percent under a C5th 
percentile cutoff. 

What Table 2 illustrates dramatically is that the larger a district's 
present Title I population, the larger is the likelihood that it would 
lose funds under a test score allocation formula. Why is this the case? 
Because, as we have stated earlier, there is not a perfect relationship 
between poverty and pOx • achievement (i.e., low test score performance). 
Poverty tends to be coiicentrated in a relatively small number of a state's 
school districts. Low test perfoi*mance is much more widely distributed 
throuRhout all school districts. This is particularly the case the nigher 
the low achievement cutoff point is established — that is, the closer the 
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definition of low achievement is to the 50th percentile and above, the 
greater the likelihood that Title I funds will be evenly distributed to 
all school districts in the state in proportion to their enrollments. 

These trends are depicted graphically in Figure 1. As the figure 
demonstrates, the equalizing effect of test scores increases as the cut- 
off point shifts from the loth or 15th percentile upward toward higher 
levels. In other words, if it is desired to utilize the test score cri- 
terion as an equalizing mechanism, the low achievement cutoff point should 
be set high. 

Other States' School Districts 

As Appendix D shows, the simulations for the other states — Delaware, 
Michigan, New Mexico, Iowa, Massachusetts, Alabama, and Hawaii— gave 
similar results as for California. Specifically, districts with present 
low levels of Title I eligible students would gain funds under a test 
score allocation criterion. Conversely, districts with high concentra- 
tions of low income children would generally stand to lose Title I funds 
under a test score criterion. Moreover, as can be seen in Tables 3 and 4, 
for California and Delaware respectively, it is large school districts 
that would generally lose funds and small school districts that would gain 
(although in Iowa, unlike the other states simulated, shifts of funds are 
such that large districts receive increased funds). 

Table 5 illustrates the percent changes in proportion of eligible 
children within the state, which is closely related to funding changes. 
Table 6 presents results for the same simulation, arranged by degree of 
loss: the average size of the district and the total enrollment are in- 
dicators of the importance of the category. 
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Table 6 



CALIFORNIA: PERCENT CHANGE IN DISTRICTS BETWEEN ALLOCATION 
BASED ON AFDC AND CENSUS CHILDREN 
VERSUS 18th PERCENTILE CUTOFF CRITERION 



Percent Change 


Nuitiber of 

School 

Districts 


Total 

Aggregate 

Enz*oiJjnenT 


A V%9 

Size of 

UlS vFXCC 


-100% to - 73% 


105 


23,65t» 


225 


- 75% to - 50% 


66 


87,028 


1,319 


- 50% to - 2E% 


102 


633,955 


6,215 


- 25% to 0 


150 


1,»»59,»»37 


9,730 


0 to 25% 


105 


U76,i|20 


4,537 


25% to 50% 


105 


t»51,807 


4,303 


50% to 75% 


68 


371,007 


5,456 


75% to 100% 


»»7 


235,875 


5,019 


100% to 125% 


40 


158,576 


3,964 


125% to 150% 


25 


127,793 


5,112 


150% to 175% 


22 


103,t»(»6 


4,702 


175% to 200% 


23 


62,790 


2,730 


200% to 225% 


1»» 


59,«H3 


4,244 


225% to 250% 


12 


12,000 


1,000 


250% to 275% 


5 


6,834 


1,367 


275% to 300% 


5 


15,328 


3,066 


300% to 325% 


5 


20 ,087 


4,017 


325% to 350% 


3 


i»2,211 


14,070 


350% to 375% 


6 


10,150 


1,692 


Over 375% 


107 


90,288 


844 
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Effects of a ''Hold Harmless^' Clause 

Frequently an effort is made to effect shifts in funding allocations 
over a period of time. If it is desired to shift the criterion for 
Title I funding to test scores, and if the number of children deemed to 
be academically deficient is the same in the state under the new defini- 
tion as under the current Title I criterion, additional money will be re- 
quired if some districts are not to lose funds in the changeover. In 
California this figure is approximately $21 million, or 17.41 percent of 
V\ 1974 Title I funding. Hold harmless figures for other states in our 
sample are below as the percent added on to present Title I allocations: 



The approximate average hold harmless figure for the states in our sample 
is 21 percent. These figures are based upon comparisons in the propor- 
tions of a state's present Title I children with its projected academically 
deficient children residing in the same district. The percentages do not 
take into account shifts in the relative proportions of children from 
state to state, and therefore must be regarded as rough approximations. 

Geographic and Ethnic Analyses 

Much of the original intent of ESEA Title I was to enable urban 
school districts to aid children who suffered from an educational deficit. 
Consequently, it would appear essential for policymakers to have a keen 
understanding of the effects of any formula change upon big city school 
districts. Would they gain or lose Title I funds? How would they fare 
relative to suburban and rural area school districts, and how would 



Iowa 

Massachusetts 
New Mexico 
Delaware 
California 



27.4 
16.7 

17.0 
17.4 



26.34% 
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minority children fare? Below we present a series of analyses designed 
to answer these questions. 

Data collected in 1972-73 were used to identify California school 
districts with respect to their being located in a central city, a suburb, 
or a rural area. Subsequently, we were able to compare the funding out- 
comes for these districts under the present Title I formula and an alter- 
native formula based on pupils* test scores. As with all our simulations, 
we assume the total pool of federal funds to be stable under both formulas 
(for the parti-^ular school year for which the analysis was conducted, the 
18th percentile equates with the number of poverty-defined Title I chil- 
dren. Thus, it is the relative distribution of those funds about which 
we are speaking in the following results. 

The bar graph in Figure 2 demonstrates the effect on central city 
school districts of a test score formula. It is cities with populations 
in excess of 200,000 that would experience the greatest loss of Title I 
funds. All other types of school districts, suburban and rural, would 
gain. For example, the Title I allocation for California's Los Angeles 
County would decrease 6 percent if the 18th percentile on test scores 
were used instead of pover*: measures. As the tabulation below reveals, 
the County's two largest city school districts, Los Angeles and Long 
Beach, would lose to an even greater degree, 16 percent and 32 percent 
respectively: 



Fiscal Year 1974 
Allocation 



Based on 
Allocation N 



18 



Percent 
Change 



Los Angeles County 
Los Angeles City 
Long Beach City 



Suburban Los Angeles County 



$49,768,415 
29,730,000 
2,083,000 
17,955,415 



$47,160,550 
25,089,147 
1,411,233 
20,660,170 



-6% 
-16 
-32 

+15 



Nig refers to the 18th percentile as the cutoff criterion. 
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The remainder of Los Angeles County, when one subtracts the above men- 
tioned two cities, is primarily suburban, and its school districts would 
gain, not lose, 15 percent more Title I money under a test score defini- 
tion of disadvantage. 

Table 7 displays such dollar shifts for the whole range of California 
district categories, and Table 8 shows the projected changes selected for 
major California cities. As can be seen from the latter table, these 
nine cities would lose approximately $12 million or 23 percent of their 
funds, under a shift to a test score allocation criterion. Simply put, 
large city school districts have students who are relatively more poor 
economically than they are academically, at least as measured by achieve- 
ment tests. As with our other analyses, the previous statement is even 
more accurate as the percentile cutoff definition of low achievement is 
escalated upward. 

Children from Black, Spanish-surname, and other ethnic minority 
groups who are eligible for Title I are out of proportion to their num- 
bers in the nationa . population. Consequently, we were eager to assess 
the effects upon such groups of shifting the formula from poverty to low 
achievement. Such analyses had to extend beyond the urban/suburban/rurs i 
assessment because minority group students are not always concentrated 
in large cities. For example, in California, the major portion of the 
Spanish-surname population is located in the agricultural valleys and 
other rural areas. In many Southern states, the Black population is 
found rather evenly distributed between cities and agricultural areas. 
Thus, in order to determine the relative distribution effects upon 
minorities, we chose school districts in both Alabama and Calif oriia 
for these analyses. 

The results are not as clear as in our other analyses^ although for 
California there definitely would occur a funding shift. The bar graph 
in Figure 3 demonstrates that when all minority group students are 
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35 Pctl. 

% Change 
(11) 


a>p)in:riH(*)aor^<p 

CMr>€M<D CMCMCMH 
1 4* 1 4- 4- 4- 4- 


30 Pctl.| 

% Change 
(10) 


V« CO CM u> W 01 CM H 
1 1 ♦ ^ 1 ♦ ♦ ♦ ^ 


V 6 ^ 
u e O) 
0- g ^ 

in 

CM 4^ 


1 1 ♦ ♦ ♦ ♦ ♦ 


20 Pctl. 

% Change 
(8) 


OGOCOC7)r-tCMC7)^(r) 
CMCOH:r HC^CMH 

1 1 ♦ ♦ 1 ♦ ♦ ♦ ♦ 


15 Pctl. 

% Change 
(7) 


U|Otf)<HHOtf)CMO> 

r« CO H :r CM CM 
1 1 ♦ ♦ 1 ♦ ♦ ♦ ♦ 


10 Pctl. 

% Change 
(6) 


1 1 4- 4> 1 4* 4- 4* 4* 


Pet. of 
Title I 
Popula. 

(5) 


HflOCOiHinCMCMCMin 
CO CM r4 A 


Pet. of 

state 

Enroll. 

(u) 


oiincoco(n(Ocoa)in 

H CO H H 


Total 
enrollment 

(3) 


828, OUS 
209,016 
l,<fr70,810 
116,109 
6U7,989 
118,018 
121,386 
706, UOl 
230,325 


No. of 
School 
Dists. 

(2) 


co:rco:r(OU)(OCM<H 

tf> CO CO CO ^ <^ ^ 


Category 
(1) 


1. Large Cities 
O^.^r 500,000 Pop. 

2. Large Cities 

200,000 to 500,000 Pop. 

3. Suburbs 

Of Large Cities 
U. Rural Areas 

Near Large Cities 
5. Hedium-Sized Cities 

50,000 to 200,000 Pop. 

Suburbs 

Of HediuiKSized Cities 

' • t\ur'ax AZ^as near 
Medium-Sized Cities 

8. Small Towns or Cities 
Under 50,000 Pop. 

9- Rural Areas Not Near 
Large or Medium Cities 
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Table 8 

CALIFORNIA: CH/\NGES IN ALLOCATIONS 
FOR SELECTED CITIES 



City 


1 

Fiscal Year 
197H Ratably 


% Change 


Allocation 
Based on N^g 


Loss 


Oakland 


$ 3,952,000 


-»*2 


$ 2,292,160 


$ 1,659,840 


Fresno 


2,035,000 


-W 


1,180,300 


854,700 


Long Beach 


2,083,000 


-32 


1,416,440 


666,560 


Sacramento 


1,979,000 


-48 


1,029,080 


949,920 


San Bernardino 


1,541,000 


-21 


1,217,390 


323,610 


San Diego 


3,245,000 


-15 


2,758,250 


486,750 


San Francisco 


U,259,000 


-39 


2,597,990 


1,661,010 


Stocktcn 


1,586,000 


-20 


1,268,800 


31V,200 


Los Angeles 


29,730,000 


-16 


24,973,200 


4,756,800 


Total 

Loss « overall 


$50, 410 ,000 
23% 




$38,733,610 


$11,676,390 
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aggregated and compared with white students, the test score allocation 
criterion would provide school districts with low percentages (from none 
to 10 percent of minority group children with 4 percent more of that 
state's total Title I appropriation than is now the case, Conversel> , as 
one would expect, sch'^ol districts with a high proportion of their student 
body from minority groups (50 to 60 percent) would lose about the same 
amount of funds under a test score formula. Table 9 shows the detail of 
this finding « 

When the ethnic minority makeup ox school districts is divided more 
linely, we see that those districts that have more than 10 percent of 
their students who are Black would lose funds (Table 10). However, the 
findings on districts with varying proportions of their student body 
comprised of Spanish-surname students are more complicated (Table 11). 
For example, districts with less than 10 percent of their eni'ollment 
Spanish-surname would gain under a test score allocation criterion. The 
findings are similar for districts with a very high proportion (more than 
80 percent) of their students Spanish-surname. However, the findings are 
very uneven in between, some categories of districts gaining and some 
losing funds. This is probably a consequence of the fact that, in 
California, Spanish-surname students are more evenly distributed than 
Black students between cities, suburbs, and rural areas. 

Similar analyses for Alabama are also complicated. As displayed in 
Table 12, those districts in Alabama which are relatively devoid of Black 
students (less than 20 percent Black) or who are relatively devoid of 
White students (more than 50 percent Black) lose funds under a test score 
criterion. However, districts whose student bodies are relatively in- 
tegrated between Black and White students would gain Title I funds. We 
are not sufficiently familiar with Alabama to offer an explanation of 
this mixed condition. However, we view it as far too complicated ever 
to be used as an argument that integration lowers test scores. 
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Interstate Analyses 

In this set of analyses our intent was to determine the degree to 
which ESEA Title I funds would be redistributed among states should pupil 
test scores, rather than family income, be used as an allocation crite- 
rion. These simulations are based upon data obtained from 13 states. 
The states involved and the important information regarding their test- 
ing programs are listed in Table 13. 



Table 13 

STATES AND TESTS USED IN INTERSTATE ANALYSES 



Testing 



state 


Test, Level, Form 


Grade 


Season 


Alabama 


SRA Achievement, Level 3, Form A 


4 


Spring 


Arizona 


Metropolitan, Primary II-H 


3 


Fall 


Arkansas 


SRA Achievement , Form F 


6 


Spring 


California 


CTBS, Level 2, Form Q 


6 


Fall 


Hawaii 


STEP, 1957 ed.. Form 4-A 


4 


Fall 


Iowa 


ITBS, II , Form 5 or 6 


5 


Fall, winter 


Mississippi 


CAT, Level 3, Form A 


5 


Spring 


New Hampshire 


SAT, II, Form X 


6 


Fall 


New Mexico 


CTBS, Level 2, Form R 


5 


October 


North Carolina 


ITBS, II , Form 5 


6 


Spring 


North Dakota 


ITBS, II, Form 5 


5 


Fall 


Rhode Island 


ITBS, II, Form 5 


4 


Fall 


Tennessee 


Metropolitan, Form F 


5 


Fall 



It is impossible to determine ii^terstate disvribution patterns pre- 
cisely without having the entire 50-state universe under observation. 
However, the 13 states in our sample are distributed among the six re- 
gions—East, Northeast, South, Midwest, Southwest, and West— and, con- 
sequently, we are hopeful that our simulations provide a reasonably ac- 
curate national prediction. 
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Our interstate analyses would have been impossible without the 
availabil-* ty of data from the **Anchor Test Study*' (ATS) conducted in 
1972 by the Educational Testing Service. Results from this study enabled 
us to interpolate between scores of the various tests used by the 13 
states in their statewide testing programs. The Anchor Test provided the 
linkage between publishers' norming tables and raw score distributions. 
(A more exhaustive description of the Anchor Test Study and its utility 
for our analytic purposes is provided in Appendix C.) 

The following tables summarise the results of the interstate analy- 
ses. Column 1 of Table 14 lists the 13 states involved in our simulations, 
Column 2 displays the state's total enrollment, Column 3 contains the 
number of Title I eligible students who reside within the boundaries of 
the state under consideration, and Columns 4 through 9 display the num- 
ber of the Title I eligible children who would be found within the state's 
boundaries should varying test score percentile cut off points be used 

as the allocation criterion. 

't 

Table 15 displays the percentage change in funding when different 
test score cutoff points are employed as Title I criteria. These per- 
centages reveal the degree to which a state would gain or lose eligible 
students and thus gain or lose Title I funds. 

Subsequent tables simply explain the above findings in greater de- 
tail. Table 16 shows the percent changes in the states' proportions of 
the national Title I. Table 17 displays the percent of students in a 
state scoring below specified Anchor Test Survey cutoff percentiles, and 
thus explains why the number of eligible students in a state would vary 
from one percentile level to another. 
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Table 17 



INTERSTATE COMPARISON: PERCENT OF CHILDREN 
SCORING BELOW ANCHOR TEST CUTOFF POINTS 



Anchor Test Score Percentile Cutoffs 
Defining Lew Achievers 



State 


10 Pctl. 


15 Pctl. 


20 Pctl. 


25 PCtle 


OU rCtX. 


qc 1 
09 rcxx« 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


Alabama i 


23eO0 


OOeHZ 


OOe X / 


tiA 7C 
*T^e /D 






Arizona' 


5e50 


9*93 


X**eH7 


XOe 03 


oi no 


O^ 


Arkansas 


lleU9 


Id* 53 


O ^ 111 


AO li A 


OC 1 1 


09 . 9X 


California 


11.00 


18e00 


A A A A 

22e00 


AC A A 


Ol HA 


OA nn 
oo. uu 


Hawaii 


8.70 


13.00 


16.00 


AA A A 


OC AA 


ofi nn 


Iowa 


1.67 


il.OO 


5.33 


9e33 


12.00 


15.35 


Mississippi** 


31.56 


10.90 


51.72 


59.75 


65.20 


70. 7»* 


New Hanpshlre^** 


7.52 


9.95 


13.58 


18e90 


23.20 


26.21 


Neir Mexico 


13.06 


23.1i» 


28.07 


33.11 


39.83 


i»5.i»0 


North Carolina 


13.00 


20.00 


25.00 


31.00 


39.00 


»*2.50 


North Dakota 


i».39 


9.!;8 


12.50 


14.14 


20.10 


25.i|6 


Rhode Island 


1 17.92 


25.60 


25.72 


26.02 


26.50 


35.24 


Tennessee 


1 10.00 


17.00 


22.00 


31.00 


39.00 


45.00 



*Metrq>olltan Percentiles (more accurate » we think f than Anchor In 
this case). 

**Only 85 percent of districts In Mississippi participated. 
***Te8ts administered to only 46 percent of New Hampshire children. 
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Based on our analysis of 13 states , there appears to be a slight 
tendency for Southern and New England states to benef i t from a test 
score allocation criterion . With a low percentile cutoff, Arkansas, 
Mississippi, Tennessee, and New Mexico (tr> the degree to which it can 
be taken as a southern state) benefit. Similarly, Rhode Island and 
New Hampshire would profit. As the test score cutoff point moves to 
higher percentile levels, only Southern states consistently continue 
to benefit, 

Arizona, California, Iowa, and North Dakota are rather consistent 
"losers" regardless of where the cutoff point is drawn. This means 
that, on balance, these are states more plagued by poverty than by poor 
achievement. Hawaii remains relatively stable, and would neither win 
nor lose substantially should a new formula be put into effect. 

Why should these shifts of funds between states come about? What 
is it about an achievement test score criterion that would enable 
Southern and New England states to profit more, than under the present 
poverty formula? The likely answer is not to be found so much in any 
abnormally low performance of Yankee and Confederate students. Rather, 
it is more likely a consequence of the imprecision of the poverty 
measures in these regions. In ways we will explain later in this report, 
the present Title I low income formula probably understates the poverty 
conditions of school districts in these two parts of the nation, at least 
partially because of the lack of uniformity with which states allocate 
AFDC payments. The shift of funds to Southern states under a test score 
formula are simply compensating for what an accurate poverty measure 
would otherwise do. If the Title I formula were changed so as to measure 
poverty more precisely, it is not likely that there would be an inter- 
state funding redistribution of any sizable magnitude. 



11-35 



Partial Summary 

The discussion and subject matter throughout this paper has a 
tendency to become complicated. Consequently, at selected intervals 
we provide summaries of the findings to that point. Here our concern 
is with the major results from our simulations of the distribution 
patterns of the present ESEA Title I poverty formula compared with a 
test score allocation criterion. Within a state: 

• Use of pupil test scores for ESEA Title I allocation purposes 
would place Title I funds in a different, though overlapping, 
set of school districts than is the case under the present 
income based formula, 

• This is so because the incidence of low achieving children 
is more uniformly distributed across school districts than 
is the incidence of children from low income families. 

• A test score allocation criterion would decrease the propor- 
tion of Title I funds flowing to large city school districts 
unless a concentration criterion or a hold harmless clause 
were incorporated, 

• In general, it appears that a hold harmless provision would 
cost at least 20 percent above what currently is being 
appropriated under the Title I poverty formula, 

• Districts with large numbers of Black students, at least in 
California, would lose Title I funds under a test score 
criterion. The picture is substantially more mixed for 
Spanish-surname children* 

• Southern school districts with roughly equal numbers of 
Black and White students would tend to benefit financially 
from a test score allocation criterion. Conversely, racially 
segregated districts would suffer under such a criterion. 

Between the states analyzed: 

• Using present ESEA Title I formula components for a comparison 
a test score distribution criterion would benefit Southern and 
New England states; they would profit from a shift of funds 
from the Western states. 
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The reason for such a shift is not that Southern students 
are disproportionately low scoring^ but that present poverty 
measures are imprecise when applied to states in these 
regions, understating the extent of deprivation. 
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Ill PRESENT VERSUS ALTERNATIVE POVERTY CRITERIA 



Debate regarding the comparative validity and utility of test scores 
and poverty measures is not limited to present Title I fomula components. 
Indeed, Congress itself always was unhappy with the original $2,000 def- 
inition of a low income fami'ly. The statutory definition was altered to 
$3,000 in 1970, but a proviso was attached that the higher figure was 
not to be used until appropriations were sufficient for full funding under 
the $2,000 definition. Subsequently, suggestions were made repeatedly 
that even the $3,000 figure was too limiting, given growing inflation 
rates. Recommendations for poverty criteria all the way to the $6,000 
level have been seriously made. 

Another component of the present ESEA Title I formula has been 
subjected to heavy criticism— this is the use of AFDC children as a 
measure of poverty entitlement. Appendix A explains the controversy in 
detail. Suffice it to say here that AFDC critics assert that it is a 
far from uniform measure. Welfare eligibility regulations and adminis- 
trative policies vary sufficiently among and within states to skew Title I 
allotments systematically. 

Income levels, regardless of the precise dollar figures involved, 
and AFDC payments have themselves come in for criticism, primarily on 
grounds that they are overly simple and subject to regional and other cir- 
cumstantial disparities. As a consequence of such criticisms, it was de- 
cided in 1974 that the ESEA would embody a more complicated poverty measure, 
the so-called ''Or^hansky index. In this section we present the results 
of our comparative analyses of the present Title I formula with alternative 
poverty measures, including Orshansky. However, before turning directly 
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to this task, we digress briefly to explain the Orshansky formula (a 
more extensive description is provided in Appendix B) . 

Orshansky Poverty Index 

In response to concern regarding the inadequacy of earlier poverty 
measures, the Social Security Administration, through the efforts of 
Mollie Orshansky of the Division of Research and Statistics, proposed a 
new measure of poverty in 1965 • Since the most seiious flaw in the 
previous standard had been Its failure to recognize family size, 
Ms. Orshansky attempted to take this additional variable into account. 

In establishing a new poverty index, Ms. Orshansky derived a stan- 
dard based on the level of income remaining after basic food needs were 
met. The foundation for determining the dietary needs of a family was 
the ''economy food plan* of the U.S. Department of Agriculture^, which is 
a nutritionally adequate food plan designed for ''emergency or temporary 
use when funds are low." Families with two members were defined as 
poor if food costs as a percent of total income was less than 28 percent, 
for families of three or more, the ratio was 33 percent. Farm families 
were assumed to need only 70 percent as much cash income as non-farm 
families. 

Two revisions were made to the Orshansky poverty index in 1969. 
Henceforth, annual adjustments in levels would be based on changes in 
the Consumer Price Index, rather than merely on changes in the cost of 
food included in the economy food plan. A second change raised the 
farm income threshhold from 70 to 85 percent of the corresponding non- 
farm level. 

Dissatisfaction has been directed at the Orshansky index from 
both sides. For example, the President's Commission on Income Maintenance 
Programs charged that adequate nutritional standards cannot be met under 
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the economy food plan for any period of time, and that the food-to- 
income ratio does not provide sufficient money for necessities. On the 
other hand, a recent GAO study contends that the Orshansky Index ex- 
aggerates the Incidence of poverty by not Including non-cash benefits 
In defining Income, The GAO maintains further that the Index's? re- 
liability Is reduced by Its failure to differentiate cost-of-living 
levels between metropolitan areas and small towns* 

Despite such criticisms, the Orshansky Index generally Is viewed as 
a more valid measure of poverty than Is a simple measure of annual Income. 

Selecting from Alternative Poverty Criteria 

The many weaknesses of the present system of measuring poverty 
for ESEA purposes prompted a search for a better allocation yardstick. 
Below we present the results of two types of simulation analyses — changes 
among school districts within states and changes among states themselves — 
using five poverty criteria: the '^Orshansky' index and family annual 
income levels of $3,000, $4,000, $5,000, and $6«000, 

Intrastate Changes 

District by district shifts within the states of California, Massa- 
chusetts, and Alabama are presented in Tables 18, 19, and 20 « For Massa-- 
chusetts and California the picture is distinct, provided the level of 
funding remained constant: districts that currently received a relatively 
low level of Title I funds would gain, and districts that are funded at 
a relatively high level would lose money under shifts to alternative 
poverty criteria « Indeed, in the case of Massachusetts, many smaller 
districts would benei'it at the expense of Boston* 

In Alabama, the redistributions are not as evident as in the above 
two industrial states* As shown in Table 20, changes do not follow any 
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ERLC 



discernable pattern. Furthermore, they are smaller in the dollar amounts 
involved than the changes in either Massachusetts or California. 

Generally, the shift from the present system to any of the alternative 
poverty criteria under consideration would have an equalizing effect 
similar to the effect of shifting to test scores: compare Figure 4 to 
Figure 1. The higher the poverty level, the more nearly the proportion 
ol target children is equal tliroughout a state's districts. 

Interstate Changes 

^Vhat would happen to total state allocations if other poverty criteria 
were chosen? An analysis was performed to assess variations in proportions 
of poor children under different poverty criteria. Detailed state-by- 
state results are displayed in Appendix D; a regional summary is presented 
in Table 21; the results for each region are shown in Tables 22 through 
29. It must be remembered that these figures represent changes in the 
number of children, rather than in actual dollar amounts. However, 
dollar allocations are obviously affected since they are related to the 
size of a state's target population. 

As shown in the regional summary (Table 21), substantial gains would 
accrue to the Southeast and Southwest regions (83 and 71 percent respec- 
tively) under the Orshansky poverty index while large losses would occur 
in the New England, Pacific, Great Lakes, and Mideast regions. The six 
states sustaining the largest decrease under the Orshansky index are 
Michigan (43 percent), Illinois (42 percent). New Jersey (49 percent). 
New York (48 percent), Massachusetts, (46 percent) and Connecticut 
(42 percent). Notice that these are all industrialized states. 

These shifts are intimately related to the variation in the quality 
and scope of state AFDC programs ♦ States that at present have poorly 
funded welfare programs also have relatively low Title I funding. The 
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substantial dollar redistributions involved in the shift to any of the 
alternative poverty criteria reflect shifts needed to strengthen Title I 
programs in states with poor AFDC pre grains. These states are principally 
located in the South, and to a lesser degree in the Plains and Rocky 
Mountain areas* 

Partial Sunmiary 

It is easy to find fault with the details of the original Census 
data and AFDC distribution formula for ESEA Title I. Even if Census 
data were frequently updated, many frailties would remain. For instance, 
the use of an annual family income level of $2,000 is a na*£ve definition 
of poverty. Inclusion of AFDC children is insufficient to rectify the 
condition, AFDC payments are far from uniformly administered within and 
among states. 

Efforts to utilize alternative poverty criteria so as to correct 
for such weaknesses eventuate in the following general conditions: 

• Alternative poverty criteria — the Orshansky index and annual 
family income levels of $3,000, $4,000, $5,000, and $6,000— 
all have approximately the same redistributional effects when 
compared with the present Title I allocation patterns. Specif- 
ically, they shift funds from districts that are now heavily 
populated by poor children to ones which have smaller concentra- 
tions. This is particularly true for industrialized states 
with large urban populations. For example, in Massachusetts, 
Boston would lose a substantial proportion of its present 
Title I funds to the other districts in the state. 

• The interstate redistribution patterns accompanying a change in 
poverty formula components is similar to the intrastate pattern. 
Specifically, industrialized states would have their share of 
Title I funds reduced at the expense of less industrial states, 
primarily those in the South and Southwest, 



11-56 



IV ALTERNATIVE POVERTY VERSUS TEST SCORE CRITERIA 

When each Is compared with the present ESEA Title I allocation for- 
mula ^ both test score and alternative poverty formula have redistribution 
effects. Each of these two formal strategies shifts Title I funds from 
present high poverty districts. However, In this section we pose a dif- 
ferent question: *Vhat Is the distributional effect when test scores are 
compared with the alternative poverty measures?^' 

To answer this question we conducted Intrastate simulations for 12 
states (our original 13 minus Hawaii). The alternative poverty measures 
are the Orshansky index and annual family incomes of $3,000, $4,000 and 
$6,000. As in all our simulations, we have assumed a constant dollar 
appropriation level for Title I funds. Thus, the shifts describe.! are 
shifts in proportions of eligible students. (Complete simulation results 
are provided in Appendix D.) For this discussion, we limit ourselves to 
the findings for Massachusetts. 

Massachusetts* School Districts 

We first categorized the districts by the proportions of their enroll- 
ment who are defined as poor under the Orshansky index, and then observed 
the changing proportions of this ''eligible** population at varying test 
score percentile cutoff points. Table 30 shows that in Massachusetts, it 
is the relatively low poverty districts (those with fewer than 10 percent 
of their students qualifying as Orshansky poor) that would gain Title I 
funds. All other distxicts would suffer a decrease in the number of 
eligible students. 
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When annual family income levels of $3^000^ $4^000, ard $6,000 are 
utilized instc of the Orshansky index, the result is essentially the 
same: as Tables 31, 32, and 33 show. Title I funds would still shift in 
a substantial way from high to low poverty districts. Specifically, it 
is those districts with fewer than 10 percent of their students poverty- 
eligible that would profit from a test score criterion. In Massachusetts, 
such low poverty districts contain over half the staters total student 
enrollment. The net effect of a switch to Orshansky or a higher annual 
income poverty definition is to move Title I funds away from those units 
now benefiting from them and into a much wider array of school districts. 

An interesting sidelight stemming from these analyses is the fact 
that, when compared with the Orshansky index or a higher income definition 
of poverty, the test score allocation criteria are not as sensitive to the 
percentile cutoff as is the case when test scores are compared with the 
present Title I formula. Apparently, most low test scores are embedded 
within the population of children whose families have an annual income 
of under $6,000. Thus, when the test score percentile cutoff is elevated 
upward toward the mean, there is very little net effect upon funding dis- 
tribution patterns. 



Partial Summary 

Alternative poverty criteria—the Orshansky index and higher annual 
family income levels and test scores have much the same redistributional 
effects. Specifically: 

• When compared with either Orshansky measured poverty or 
higher income level poverty, test scores will concentrate 
Title I funds in districts with low proportions (under 

10 percent) of poor students. 

* A test score allocation criterion^ when compared with 
Orshansky and higher income level poverty, would redistri- 
bute Title I funds to a wider number of school districts 
throughout a state. 
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The test score percentile cutoff defining low achievement 
is not a sensitive matter when compared with alternative 
poverty measures. Whether low achievement is defined at 
the 10th or the 35th percentile has relatively little 
effect upon allocations* 
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V POVERTY AND TEST SCORE STRATEGIES COMBINED 



Given that poverty and pupil performance measures have both advantages 
and disadvantages, is there any way that they can be combined to overcome 
their weaknesses and maximize their strengths? If such a linkage were 
made, it might be possible to surmount logistical and administrative weak- 
nesses of the two types of eligibility measures. Moreover, combining the 
two would take official cognizance of the fact that, though practically 
linked, the two phenomena are conceptually distinct. 

A child from a low income family may or may not be academically 
deficient. Regardless of academic standing, it is likely that the low 
income student has not had the social opportunities that would permit 
fulfillment of his or her potential. ^Compensatory education'' in this 
instance is directed at making up a deficit. Whether it is a deficit 
between the student's present low achievement and some minimum expected 
level of school performance or between the student's actual and potential 
level of performance is, in this instance, inconsequential. 

Similarly, an academically deficient student may or may not come 
from a low income household. All that is important from society's stand- 
point is that the individual has not reached some minimally acceptable 
level of school performance and, therefore, is in need of compensatory 
education. The presence or absence of poverty may shape the kind and 
amount of the compensatory services made available to the child. However, 
the presence or absence of proverty should not determine whether or not 
an academically deficient child receives needed school services. 
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Double-Counting Formula Simulations 

Tables 34 through 39 display the results of two different kinds of 
double-count formula simulations. For six states — California, Alabama, 
Delaware, Iowa, Massachusetts, and New Mexico — we analyzed the conse- 
quences of permitting school districts to determine Title I eligible 
students by counting both their poverty and their low achieving pupils. 
It is the sum of these two enumerations that would provide the Title I 
eligibility figure. The initial set of simulations (Tables 34 through 3C) 
of such a double-count formula was done with the original Title I formula 
(Census data plus AFDC) as the poverty measure. The second set of double- 
count simulations (Tables 37 through 39) was performed using the Orshansky 
index. Within both sets of simulations, a further breakdown by urban/ 
suburban/rural area and by ethnic group is provided. 

The end product of both sets of double-count simulations is a sug- 
gestion of the relative amount of additional Title I funds that would be 
necessary if poverty and poor performance were incoporated in the allo- 
cation formula. Under the first double-count simulation (Census data 
and AFDC plus low achievement) the number of Title I eligible students 
would increase by 123 percent. Under the second set of simulations 
(Orshansky index plus low achievement) the number of Title I eligible 
students would increase by 144 percent. 

Almost by definition, all states and almost all districts within 
states would benefit financially under a double-counting formula. Among 
states, Alabama would profit most from using a double-count formula based 
on the Orshansky index. As we have discussed previously, this is probably 
tnip because of the low funding level of Alabama's welfare programs, which 
understate the number of poverty children at present. This inadequacy 
would be compensated for under the Orshansky index, thus resulting in 
the relatively greater gain for Alabama under a double count, 
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Within states, it is the districts that currently have low concen- 
trations of poverty children which would benefit most from a double count, 
Again, somewhat obviously, these districts would benefit because poverty 
and low performance are not identical. Low poverty districts, when per- 
mitted to determine eligibility by low achievement, would thus gain in 
their numbers of eligible students. The finding holds for the analyses 
by type of school district and by ethnic group. Within California, it 
is the suburban and rural school districts, and those with the lowest 
concentrations of ethnic minorities, that would gain the largest number 
of eligible children under a double count. Dy the nature of double 
counting, this would occur without diminishing the number of eligible 
students in cities or in other districts with high concentrations of poor 
and low achieving students. 

If a double-count formula were fully funded, it would expand the 
base of Title I without eroding allocations to present recipient districts. 
However, it would be extraordinarily expensive to fund. Total Title I 
appropriations would have to increase by more than $2 billion over present 
funding levels. For this reason we move to our next four aets of simu- 
lations; they demonstrate ways in which both poverty and low performance 
can be utilized, but with substantially lower amounts of federal money 
being required. By so doing, we hope to suggest a politically more 
feasible set of strategies. 

Alternative Counting Simulation s 

The following four sets of simulations were designed to illustrate 
the potential effects of alternating poverty with student performance as 
Title I allocation measures. In essence, there are two major simulations 
here, each of which has two variations. First, we simulated the distri- 
butional consequenr j of permitting school districts to compare the number 
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of eligible students under the present Title I poverty formula (children 
from families with annual incomes below $2,000 and AFDC children) with 
the number of children who would be eligible under a test score definition. 
The district would then be able to select the greater of the two eligibility 
numbers. In performing this simulation, and all the other simulations 
we discuss in this section as well, we assume that the total Title I dollar 
appropriation would be increased to encompass the additional numbers of 
eligible students. In effect, then, permitting districts to choose the 
higher of the two eligibility calculations would ''hold harmless" the 
allocations for all school districts. No district could receive less 
Title I money than is currently the case; however, a substantial number 
of districts would gain. 

The second major set of simulations contains the essence of number 
one above, but replaces the present Title I poverty formula with the 
Orshansky index. In short, in this analysis, a school district chooses 
the greater number of eligible students according to (1) Orshansky poverty 
or (2) academic deficiently as defined by test scores. 

Within each of the above-described sets of simulatioiis we performed 
two additional analyses. F rst, we made each school district compute its 
Title I entitlement on the sole basis of poverty students, u nless its 
academically deficient students exceeded 16 percent of its student en- 
rollment . If tue latter condition were the case, then the district could 
base its Title I entitlement on the larger figure— poverty eligible students 
or test score eligible students. This analysis was conducted using both 
the present Title I proverty formula and the Orshansky index. Again, we 
assumed that total Title I dollar appropriations would be expanded to 
fund the additional eligible students. 
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The rationale behind this subset of simulations is that, within 
reason, districts should expect to absorb the costs of compensating to 
some degree for low performing students. However, when the proportion 
of these students reaches a critical point~16 percent was arbitrarily 
selected as an illustrative level — then the district is. faced with an 
added burden of such proportions as to justify external funding assis- 
tance. Of course, this decision rule of having a district absorb a spec- 
ified level of academically deficient students on its own resources could 
also be made to hold for low proportions of poverty students. However, the 
political unreality of such a move argued against our attempting to sim- 
ulate a poverty absorption decision rule for this set of analyses. 

In addition to simulating the 16 percent low performance absorption 
rule for each of the two major analyses (present poverty and alternative 
poverty measures), we analyzed the consequences for California of all four 
formula variations upon urban school districts and ethnic minorities. 
The resultant alternative counting simulations distribute themselves over 
the following categories: 



Simulation I 



Simulation II 



Present Poverty 
+ Low Performance 



Effect on 

urban 

areas 



Effect on 

ethnic 

minorities 



No 

absorption 
of 
low 

performance 



Alternative Poverty 
+ Low Performance 



Effect on Effect on 
urban ethnic 
areas minorities 
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Simulation III 



Simulation IV 



Present Poverty 
+ Low Performance 



Effect on 

urban 

areas 



Effect on 

ethnic 

minorities 



16% 
absorption 

of 

low 
performance 



Alternative Poverty 
+ Low Performance 



Effect on 

urban 

areas 



Effect on 

ethnic 

minorities 



Simulation I Results 

Table 40 displays the funding increments, by category of poverty 
impacted school dirtrict, for the six states for which these analyses 
were conducted. Poverty in this instance is measured by the present 
Title I formula ($2,000 and AFDC) . We can see that when districts are 
permitted to select either their poverty eligible or test score eligible 
students, they gain substantial funds. The range of additional funds is 
from a low of 5.96 percent for the state of Iowa to 164,31 percent for 
Alabama.* The latter figure reflects the failure of the present Title I 
formula to measure accurately Alabama's poverty, and thus permits the 
state to profit greatly from disproportionately large numbers of low per- 
forming children. 

Table 41 demonstrates for California that the alternative count 
Title I formula would bring substantial benefits to small cities, suburbs, 
and rural areas. These are the types of school districts that would gain 
the most new money under such an arrangement. Large city districts would 
gain only slightly. As we have already stated, under this funding arrange- 
ment, no districts would lose funds. 



See the note in Appendix C regarding Iowa figures for Simulations I, 
III, and V, 
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ERIC 



Table 42 displays the fact that it is school districts with the 
lowest concentrations of minority students that would gain Title I funds 
under the alternative counting stragety« 

Simulation II Results 

From Table 43 it can be seen that the use of the Orshansky index, 
instead of a $2,000 annual income level and AFDC children, markedly changes 
the funding pattern. In that it is based on a higher level of annual 
family income, the Orshansky index, it has the effect for much of the U.S. 
of increasing the number of poverty eligible children. By permitiing 
districts alternative counting, we are enabling many of them to select 
from among two figures, both of which are frequently higher than their 
present number of eligible students. Thus, we see in Table 43 that all 
our sample states, except Alabama, gain Title I funds under the Orshansky 
index alternative count. 

Tables 44 and 45 show that, for California, the Orshansky alternative 
count formula benefits urban school districts and areas with hi4:;h concen- 
tractions of minority group children. Again, this would appear to be 
the effect of the more generous definition of poverty embedded in the 
Orshansky index compared with the present formula. 

Simulations III and IV Results 

Tables 46 through 51 present the results for the remaining two sim- 
ulations. For both the present poverty formula and the Orshansky Index 
we have analyzed the consequences of making school districts absorb the 
first 16 percent of their academically deficient students. Consequences 
of the absorption rule are twofold. First, it decreases the total amocnt 
of additional Title I money that would be necessary to fund the newly 
eligible students under alternative counting. This is particularly true 
under the present formula (Simulation III) . 
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Second, particularly under Simulation VI, large urban districts and 
those areas with high concentrations of ethnic minority students would 
gain the most. The absorption rule operates to the disadvantage of sub- 
urban and small city school districts, which have some academically 
deficient students but do not suffer from high concentrations of them. 
When asked to pick up the costs of compensatory education for such students, 
these little-burdened districts profit little from alternative counting. 

Holding Harmless 

By weighting a state's average Title I gain by the number of pupils 
within its borders, we were able to calculate the five-state total mean 
for additional Title I funds under the four alternative counting and two 
double counting simulations.* These figures, shown below as the percent 
added on to present Title I funds, can be viewed as the costs, to the 
six states for which we conducted simulations, of holding all districts 
harmless : 



As can be seen from the above figures, the costs of double and alternative 
counting range from approximately 30 to 140 percent more than currently 
is spent for Title I. In dollar terms, this means that Congress would 
need to appropriate from $480 to $2,160 million over the present total 



Double counting 



Alternative counting 




^Simulation III 

Simulation IV 

Simulation V 

^Simulation VI 



123.25% 
143.79 
42.32 
54.56 
32.71 
38.97 



Iowa is excluded from this average because of the problems previously 
mentioned and explained in Appendix C. 
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national Title I funding level. Of course, these figures can be manipu- 
lated easily by varying components such as the absorption rate in Simula 
tions III and IV. 

Partial Summary 

By constructing a formula that permits school districts to double 
and alternatively count students — those eligible by a poverty definition 
and those eligible by a test score definition — and to take the total of 
the two or the higher of the two, we recognize that both poverty and low 
pupil performance are problems affecting the nation and worthy of nation 
wide efforts to alleviate. The double and alternative counting simula- 
tions gave the following results: 

• Double counting holds all present districts harmless and 
adds substantially to districts that now are low in the 
proportions of their enrollments from low income and 
minority households, 

• When the alternative-counting is accomplished with the 
present Title I formula, big city school districts would 
gain only a small amount, whereas small cities, suburbs, 
and rural areas would gain a great deal* 

• Under a double counting formula, using the Orshansky 
index. Congress would have to increase the present 
Title I appropriation by more than 140 percent, 

• When the poverty formula is switched to the Orshansky 
index and alternative counting is permitted, city school 
districts as well as high minority districts benefit 
more than suburban and low minority districts. 

• Under the Orshansky alternative counting simulation. 
Congress would have to appropriate over 50 percent 
more money than at present. 

• If districts were asked to absorb the first 16 percent 
of their academically deficient students, the number 
of additional eligible students is reduced : only 

30 percent more Title I money would be needed. 
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Under the Orshansky alternative counting formula and the 
16 percent absorption rule, slightly more than 36 percent 
more money would be necessary. 

Under any of the double-counting alternative counting 
formulas tested, no districts would lose federal funds. 
They would, in effect, all be held harmless. 



11-92 



VI CONCLUSIONS AND RECOMMENDATIONS 



Several significant findings emerged from our simulations. 

First, by themselves, both pupil performance indicators and alterna- 
tive poverty measures (the Orshansky index and higher family annual income 
levels) have an equalizing effect upon the distribution of Title I funds: 
that is, these new formula components would tend to shift compensatory 
education dollars from areas where they are now concentrated to a larger 
number of districts, frequently at the expense of big city districts and 
those high concentrations of minority students, (This finding is based 
on the assumption that the pool of Title I funds remains relatively 
stable and that there is no **hold harmless** provision incorporated into 
the law.) Thus we would argue against the inclusion of either simple 
formula revision. 

Despite a tendency to dilute the distribution of Title I, the pupil 
performance measures appear to identify an additional population of stu- 
dents in need of more intensive schooling. It can be argued that if these 
students are permitted to remain academically deficient, they may become 
a hardship on the entire nation. Thus, federal intervention is Justified. 
However, to do so at the expense of children from low income households — 
even if the latter are not found to be academically deficient by the 
limited measures now being used — strikes us as unwise and unfair. Stu- 
dents from poor families frequently need additional services if they are 
to realize the full potential of their capabilities. If such services 
are not provided by government, they are not likely to be provided at all. 

Why not then immediately revise the ESEA Title I allocation formula 
to embody the principles of alternative or double-counting? If such 
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cli'^ t ibution formulas called for higher levels of funding than the Pres- 
ident and Congress can envision | then an absorption rule — making the states 
responsible for the compensatory education of a certain proportion of eli- 
gible students — could be established so as to adjust total state entitle- 
ments to the amount of available funding. Such adjustments would preserve 
the Treasury while simultaneously declaring both poverty and low academic 
performance to be national problems. 

Perhaps surprisingly, we would caution against such a Title I formula 
revision at this time. Our caveat is extended not because of any abstract 
distaste for the principles Involved. Rather, we believe that the present 
state of the art would make the d'.stribution of Title I funds based on 
testing unreliable and expensive. Moreover , there is some risk that test 
score payments will establish performance disincentives. (By contrast, 
poverty measures appear relatively unobtrusive. In short, there are at 
present too many practical problems with using testing for allocation 
purposes. 

What then to do with this idea that is good in the abstract, but 
impractical? We suggest an "experiment," specifically, an effort to 
assess systematically the consequences of distributing school aid based 
on pupil performance measures. Such a L^tudy need not be inordinately 
expensive. As mentioned in the Introduction, the state of Michigan cur- 
rently has such a formula funding provision. Its effects could be care- 
fully studied and the results used to inform federal officials as to 
whether or not the practical problems of testing could be overcome for 
ESEA Title I distribution purposes.* 

Recently, two evaluations of Michigan's compensatory education program 
have appeared. See Ernest R. House, Wendell Rivers, and Daniel L. 
Stufflebeam, "An Assessment of the Michigan Accountability System," 
PV>7 Delta Kappan , June, 1974, pp. 663-669; and Jerome T. Murphy and 
David K. Cohen, ^Accountability in Education — the Michigan Experience," 
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Beyond the desire to incorporate pupil performance measures into a 
funding formula, we believe that an additional step forward can be taken. 
On balance, the Orshansky index strikes us as being a more sophisticated 
measure of poverty. It does not compensate for all the weaknesses of 
other poverty enumerators, but it is a relatively objective metric and 
can be improved still more. Conversely, income levels by themselves are 
overly simple, and AFDC arrangements show only the slowest promise of ever 
being standardized among and within states. Consequently, Congress 
should resist any further efforts to drop the Orshansky index, unless it 
is clear that there is a better measure of poverty. 

As with the public performance measures, a caveat must be extended 
with the Orshansky index recommendation. By itself, this poverty measure 
will decrease the relative funding of cities and some categories of mi- 
nority groups. Consequently, when incorporated into the Title I formula, 
it should be accompanied by a ' hold harmless*' or concentration provision 
for three or four years of transition. 

Aside from their potential educational disadvantages, the incorpora- 
tion of the Orshansky index and pupil performance measures as allocation 
criteria would probably attract a wider, more popular base of political 
support for Title I. During the 1960s, this hardly seemed a problem. 
However, events of the early 1970s have demonstrated that shifts in 
Administration policy and economic instability argue for well buttressed 
extensive electoral support for federal education programs. 



The Public Interest, Summer, 1974, pp. 53-81. However, these studies 
do not assess the Michigan strategy with sufficient intensity to answer 
the questions posed in this effort. 



11-95 



REFERENCES 



1. New York State Commission to Study the Costy Quality , and Financing 
of Elementary and Secondary Education ^ Vol. I, Chapter 3, Viking 
Press, New York. 

2. Charles S. Benson et al., Report to the Senate Select Committee on 
School District Finance , California State Senate, Sacramento (1972). 

3. George Madaus and Richard F. Elmore, '^Allocation of Federal Compensa- 
tory Education Funds on the Basis of Pupil Achievement Test Perfor- 
mance/* Statement submitted to the General Education Subcommittee 

of the Committee on Education and Labor, U.S. House of Representatives 
(June 26, 1973). 

4. State Educational Assessment Programs, 1973 Revision , published by 
the Educational Testing Service in collaboration with the Education 
Commission of The States, 



ERIC 



11-97 



Appendix A 

WEAKNESSES OF CENSUS COUNT AND AFDC DATA FOR USE 
IN ESEA TITI£ I DISTRIBUTION FORMULAS 



ERIC 



Appendix A 



\VEAKNESSES OF CENSUS COUNT AND AFDC DATA FOR USE 
IN ESEA TITm I DISTRIBUTION FORMULAS 

In any discussion of methods of distributing funds by various cri- 
teria, one must assess not only the goals of the distribution itself, 
but also the validity of the measures used to determine eligibility. 
This appendix is such an assessment. First we explain the relationship 
between Census, AFDC counts, and the apportionment of ESEA Title I funds; 
then we examine those criteria in isolation to detcrniine their relative 
strengths and weaknesses. Finally, we re-link the Census and AFDC counts 
with Title I apportionments to assess the biases that those counts might 
introduce into the system of ESEA grants. 

The first issue to be dealt with, then, is the bases for apportion- 
ing Title I money. According to the governing regulations, the size of 
the grant to a local educational agency is determined by applying a dis- 
tribution formula to : 

T.>? sum of (1) the number of children aged 5 to 
17, inclusive, in families residing in the school dis- 
trict of the local educational agency and having an 
annual income of less than the low-income factor,* « 
[i.e., $2,000], (2) the number of children of those 
ages in families residing in the school district and 
receiving, from payments under the program of aid to 



Regulations, Title I of the Elementary and Secondary Education Act of 
1965, Public Law 89-10. Title 45, Part 116 of the Code of Federal 
Regulations, Revised and Amended. 
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families with dependent children under a State plan 
approved under Title IV of the Social Security Act, 
an annual income in excess of the low-income factor..., 
(3) the number of children of those ages living in the 
school district in institutions (other than institutions 
operated by the United States) for neglected or delin- 
quent children (other than children for whom a State 
agency is directly responsible for providing free public 
education) and (4) the number of children of those ages 
living in foster homes in the school district and being 
supported with public funds..,. (116.3) 

There are biases introduced by each of these four criteria. However, 
since Census count children and those receiving aid to families with de- 
pendent children account for the vast majority of Title I eligible children, 
it is the major flaws of these two measures upon which we shall concen- 
trate. 

The first me thod— family annual income below $2 ,000— Inaccurately 
counts eligible children because it is updated only once every ten years ^ 
by the decennial Census. However, because of the Herculean problems in- 
volved in amassing the voluminous data contained therein, it takes another 
four years before these data reach a form useful for counting purposes by 
other agencies. The result for a program such as ESEA Title I is that 
1973 apportionments are based on 1960 Census figures, which used 1959 
income figures as their base. In 1974, it is true, apportionments will 
use the latest, i.e., 1970, Census data; but these data will already be 
four years out-of-date since they were collected in 1970 on the basis of 
1969 income figures. 

Table A-1 illuminates the magnitude of the change between the 1960 
and 1970 Census counts: a full 46.5 percent drop has been recorded in 
the number of families nationwide with incomes under $2,000, ranging by 
state from a 69 percent decrease in North Carolina to a 23 percent increase 
in Nevada* 
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Table A-1 



CENSUS CHANGES IN NUMBER OF CHILDREN FROM FAMILIES 
WITH ANNUAL INCOME UNDER $2,000 
(Selected States) 



Staie 


1960 Census 


1970 Census- 


Difference 


Alabama 


242,522 


95,984 


-60.4 


Arizona 


38,851 


29,328 


-24.5 


Connecticut 


20 ,731 


22,226 


+7.2 


Illinois 


147,518 


103,789 


-29.6 


Minnesota 


77,280 


31,885 


-58.7 


Nevada 


3,230 


3,964 


+22.7 


New Jersey 


59 , 845 


57,733 


-3.5 


North Carolina 


323,096 


99,224 


-69.2 


Pennsylvania 


175,394 


102,040 


-41.8 


Total U.S. 


4,948,119 


2,645,820 


-46.5 



The problems associated with using an AFDC-need Index are even more 
complex. However, before describing these problems, It Is Important to 
understand the history and functions of the program. 

When enacted In 1935, the Social Security Act Incluied titles estab- 
lishing the following categorical programs: Old Age Assistance, Aid to 
Families with Dependent Children, Aid to the Blind, and the Combined Pro- 
gram of Aid to the Aged, Blind, or Disabled. Originally the Social 
Security Act was not conceived as a comprehensive or Integrated public 
welfare program. Rather Its services were Intended to assist states to 
support only persons unable to work owing to age or blindness, and families 
with no wage earner. 
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This intent in and of itseli. provokes a measurement problem. The 
narrowness of the eligibility definition means that a large proportion 
of the poor population and their dependents, including those who cannot 
find work or who work at very low wages, are denied eligibility and 
therefore relief under AFDC, Such families could not have been counted 
as part of the ESEA Title I entitlement for a district under the low- 
income factor, either, unless they had been so counted in the latest 
decennial Census year and had remained in the same school district. 
Also, if the family had been determined eligible for an AFDC grant but 
the size of that grant was under $2,000, the family would not be in- 
cluded for Title I counting purposes because the definition in the 
Regulations would assume that their income was as low as the last Census 
count and that they had already been included. However, if this were 
the 1973-74 school year, with entitlements based on 1960 data, the 
chances of such an assumption being correct are slim. 

The President's Commission on Income Maintenance Programs estimated 
that over 58 percent of poor children were not covered by AFDC or OASDI. 
Furthermore, of 12,5 million poor children in 1966, a minimal estimate 
of 3,5 million received AFDC, Although there are no precise figures, 
one must logically conclude that a sizable number were likewise not 
counted under the low- income factor because they had not been conceived 
at the time of the last census update. 

In constructing the AFDC program. Congress initially established 
basic eligibility guidelines. An AFDC applicant must: 



President's Commission on Income Maintenance Programs, Background Papers , 
Government Printing Office, Washington, D.C, , p. 279, 1971. 
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•.. demonstrate that the child is deprived of the 
care and support of one parent by death, desertion, in- 
capacity or, in 21 States, unemployment. 

If the cause is desertion, she must agree to report 
the child's father to the District Attorney and, usually 
swear out a warrant for non support. 

In most States she must prove that she has been a 
resident for one year. 

She must show that she has no real property, or 
that it is valued within the prescribed limits. 

She must show that her income is insufficient for 
self-support — i.e., that there is a budget deficiency. 

She must meet whatever special requirements the 
State may impose. 

She must give a ''social study" describing her 
background and history, and make a plan for herself and 
her child to lead toward self-support. 

She must submit to home visits by social workers^ 

She must be prepared to have all statements re- 
ferring to eligibility verified through birth or marriage 
certificates, credit checks, letters to employers, 
insurance companies, banks, etc.* 

Since the program was to be state administered. Congress left to the 
states the task of adopting more precise eligibility definitions. The 
result has been a substantial difference between states regarding family 
composition and assets defined as appropriate for receiving assistance. 
One major difference occurred as recently as 1961 when Congress approved 
an amendment permitting AFDC payments to families in which the father was 
present and unemployed. (The program is called AFDC - Unemployed Parent, 
or AFDC-U). However, only 25 states take advantage of this new program; 
the remaining 25 permit no aid when the father is present in the house- 
hold. (This is known as the ''man-in- the-house** rule, and under it a 
child is deemed not to be "deprived of care and support" if there is a 
man present regardless of his legal requirement to support.) This 



Ibid., p. 273. 
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dichotomy is of particular consequence because states without the AFDC-U 
program are those with the nation's highest unemployment rates. Their 
inclusion under AFDC-U would drastically alter current AFDC expenditure 
patterns. 

Another major variant between states is the residency requirement. 
Under the Social Security Act, states could impose a residency require- 
ment not in excess of one year. Up until 1969, 11 states had no such 
residency requirements and 39 states had a one-year requirement. However, 
the Supreme Court banned residence requirements in 1969 as an infringement 
on the fundamental right to travel. 

Additional between-state variations include the following: 

• A state may pay for children up to age 21 if they 
are in school : 

- 38 States have some such provision 

- 9 States will pay to a maximum age 18 under cer- 
tain circumstances 

- 3 states will pay to maximum age 16, IP., or 20, 
respectively 

• Real property used for a home may be owned 

- with no value specified in 32 states 

- value under $3,000 in 4 states 

- value $3,001 to $7,500 in 8 states 

- value over $7,500 in 6 states 

• Personal property may be retained 

- with value under $500 in 12 states 

- value $500 to $749 in 12 states 

- value $750 to $999 in 10 states 

- value over $1,000 in 11 states 

- miscellaneous maximums by item or other limit 
in 5 states. 



Shapiro v. Thompson , 394 U.S. 618, 89 S. Ct. 1322, 22 L. Ed. 2d 600 
(1969). 
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• Liens ) recoveries or assignments are required in 11 states* 

• Provisions for disregarding certain income above the man- 
datory provisions are utilized in 25 states.'*' 

In some states the "continued absence" of the father is defined as 
over 90 days, while in others a woman need only file for a legal separa- 
tion or a divorce. Illegitimate children are often considered automatically 
deprived and no waiting period is required for receipt of aiu. 

In addition to numerous variations among states , there are some with- 
in states. According to the President's Commission on Income Maintenance 
Programs : 

••. the inequities within States are due to anomalies 
in the definition of unemployment. This definition, usually 
in terms of hours worked, may vary if one is already receiv- 
ing assistance as opposed to one who is applying. A man on 
assistance working 20 hours may continue to be eligible while 
a man applying may be considered employed. In seme cases, 
because of differences in hourly wages, a person working 
20 hours a week may make more than one working 30 and still 
be eligible. A man who works 29 hours may be eligible 
but not if he works 30. A man on the programs [sic] allowed 
to deduct costs of employment and, even though he has no in- 
centive such as the $30 and one-third provision, may still make 
more than his nonassistance counterpart because his cost of 
transportation is reimbursed while the person not on assis- 
tance must absorb his,^ 

Another study maintains that there is as much variance among counties 
in a given state as there is among states. A study conducted by the Social 
Yorkers Union revealed additional variance in the treatment of individuals 
within the same department and same work unit.^ 



President's Commission, op. cit., p, 273. 

t 

Ibid., p. 279. 

* 

American Civil Liberties Union and Social Workers Union 535, Welfare : 
The Question of Unequal Treatment. 
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Aside from the formal eligibility requirements, there are several 
additional disincentives in the AFDC program. In order to receive aid 
under the AFDC program, a woman must submit to a large number of personal 
questions. She must also be willing to swear out a NOI£0~Notice to Law 
Enforcement Officers — warrant accusing her husband of desertion and non- 
support. In Madison, Wisconsin it was reported in 1968 that she must 
submit to a lie detector test to prove her sincerity. If she lives in 
a small rural town, where she is known by everyone and will be branded as 
a welfare recipient, she has further disincentives for application. Fi- 
nally, if she lives in a state such as Missississippi , where the average 
1969 per-person payment was $9.20 per month and the maximum family payment 
was $130 per month — or in any of the other 14 states where 1969 payments 
averaged less than $30 per recipient — she might think twice before further 
alienating her husband by swearing out a NOLEO warrant and further sub- 
mitting herself to the attendant humiliations of welfare receipt. 

Another skewing factor, especially for Southern states such as 
Mjssi:-;^ippi and Arkansas, is tha*: they have large ntimbers of persons 
receiv^) g AFDC grants, but most recipients in Arkansas and all in Missis- 
sippi have incomes below $2,000 per year. As a consequence, although the 
number of people on AFDC drastically increased in the 1960s, only wealthier 
states with higher AFDC payments (i.e., over $2,000 per family yearly) 
received the yearly increments in ESEA moneys. 

The above shows that there are several factors which serve to skew 
the distribution of federal moneys under the ESEA Title I program. De- 
pendence upon decennial Census counts and AFDC grants, the latter in 
particular, build into the ESEA distribution structure the same defects 

that these two individual counts embody. 
I 
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Four methods are currently being considered for providing 
an updated enumeration of eligible children at the county level 
(other than institutional children) * The first three involve 
adjustments to the initial distribution obtained from the 
decennial Census, The fourth is a separate enumeration, in- 
dependr it of the decennial Census : 

(1) Adjust each county's enumeration by the amount of 
year-to-year change in the state total. In this 
method each county's share of the state total re- 
mains fixed between years of the decennial Census, 

(2) Adjust each county's enumeration by the percen- 
tage change in the AFDC count for that county.^ 
Then readjust the data for all counties in a 
state proportionately to conform to the state 
totals derived by other means, 

(3) Adjust each county's enumeration by an increment 
proportional to the increment in the county's 
AFDC count. The adjustment is scaled in size 

in order to insure that the sum of adjusted 
county enumerations within a state add up to 
the known state enumeration, 

(4) Use the most recently available AI'DC count for 
each county, irrespective of Census data. 



John J. Donaldson, Statement to General Subcommittee on liducation ol 
the Committee on Education and Labor, House of Representatives, Wash- 
ington, D,C,, Apr, 3, 1973. Published in Elementary and Secondary 
Education Amendments of 1973, Part 3 and Appendix, p. 2399. 

In this and the methods that follow, the AFDC count is the number ol 
children (ages 5-17 or 0-20, depending on data availability) in families 
receiving AFDC payments in any amount, except those in the optional 
unemployed-parent program. 
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APPENDIX B 
THE ORSHANSKY POVERTY INDEX 

When the Council of Economic Advisors defined a poor family in 1963 
as one v^'ith a yearly income of less than $3,000, it noted that this was 
''a crude and approximate measure/' Subsequently, Mollie Orshansky of the 
Division of Research and Statistics, Social Security Administration, sug- 
gested that family size was a major variable in determining financial need. 
In 1965 she carried her research further to define ''equivalent incomes at 
a poverty level for a large number of different family types."* Special 
schedules were also made for farm and nonfarm fan lies, recognizing that 
farm. families can generally live on less cash income than city dwellers. 

In establishing the poverty index, Ms, Orshansky attempted to derive 
a standard based on the level of income remaining after basic food needs 
were met. She described her standard as "admittedly arbitrary, but not 
unreasonable.^ 

In determining basic food needs, Ms. Orshansky initially used the 
"low-cost" food plan of the U.S. Department of Agriculture. (Welfare 
agencies also typically employ this plan to establish grant levels.) In 
1964 the USDA issued a second plan, known as the "economy" food plan, 
which costs 75 to 80 percent as much as the "low-cost" food plan and is 
a nutritionally adequate food plan designed for "emergency or temporary 
use when funds are low." The Orshansky index most often referred to is 



Mollie Orshansky, "Counting the Poor: Another Look at the Poverty 
Profile," Social Security Bulletin , Vol. 28, No. 1, p. 3 (January 1965) • 

Ibid. , p. 4 • 



11-113 



based on the latter. The low-cost plan assumes an average per-person cost 
of $5.90 per week; the economy plan assumes $4.60 per week per person. 

The food costs for each family size were determined according to the 
economy plan, at January 1964 prices, as a point of departure for estimat- 
ing a base. If a family of three or more had an income that was less than 
three times that base, it was defined as poor. For families with two 
members, the ratio of food to income was set at 27 percent to allow for 
the relatively higher fixed costs of operating a small household. To 
account for all possible family combinations, several income points wore 
developed for families at each size, but with different combinations of 
chilren and adults, and based on the sex of the head of the household. 
Then they were weighted together in accordance with the number of families 
of each type within the population to establish a basic set of income 
and expenditure figures. 

Since farm families can usually count most of their household ex- 
penses and a percentage of their food expenditures as part of their agri- 
cultural operation, the Orshansky index assumed that they would need from 
30 to 40 percent less cash income in order to maintain a living standard 
equal to that of a city family of similar composition. 

In sum, the Orshansky index separated farm and nonfarm families and 
tor each separately classified (1) unrelated indi\/iduals by age and sex, 
and families by sex of the household head, (2) total number of family 
members, and (3) number of children under age 18. The income was then 
compared with the appropriate minimum income schedule. 

Orshansky herself has noted that while the Council of Economic 
Advisors (CEA) used $3,000 as the poverty standard for a family of two 
or more persons and $1,500 as the minimum standard for an individual, her 
standard pivoted (in 1965) around $3,130 for an average family of four 
persons and $1,540 for an unrelated individual. Although she admitted 
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that her standard was not ''materially different" from the earlier one and 
that the number of persons defined as poor was approximately the same 
(29.75 million compared with 28.50 million under the CEA definition), she 
noted that the major difference was in the types of persons and families 
counted. Also, though the number of poor individuals increased, the num- 
ber of poor families decreased from 8.8 million in 1963 under CEA stan- 
dards to 7.2 million by the Orshansky index. Fewer small families and 
more large families, with a higher number of children, would be counted* 
Farm families would also have a significantly different (and lower) repre- 
sentation in the index. Thus, while the previous standard represented 
the farm family and the aged as poor, the new Orshansky index portrayed 
the typical poor as young nonfarm families with several children. By 
either definition there wore approximately two million nonwhite ^'poor 
units/* but since the total number of families counted as poor was smaller, 
nonwhite families made a larger percentage of the total. The profile of 
single individuals rema^ nod. approximately the same under either count 
since the income standard was approximately the same and few single indi- 
viduals live on farms. 

Orshansky noted that although the food-income index might be ques- 
tioned by some, it at least provided an interim guide for measuring living 
standards of families in different circumstances. She realized that city 
dwellers were at a disadvantage to those in small towns and that income- 
consumption patterns needed further clarification. 

Subsequently the Bureau of the Census and the Social Security Admin- 
istration officially adopted what became known as the '^Orshansky" poverty 
index. In 1969 a Federal Interagency Committee studied the index and 
recommomled two modifications which were subsequently adopted: '*!) that 
Socl/il Security Administration thresholds for nonfarm families bo retained 
for the base year 1963, but that annual adjustments in levels bo based on 
changes in the Consumer Price Index (CPI) rather than on changes in the 
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cost of food included in the economy food plan; and 2) that the farm 
thresholds be raised from 70 to 85 percent of the corresponding non-farm 
levels. 

The first change was raado following release of a study that compared 
the changes in the CPI with the cost of the economy food plan. As 
Table B-1 suggests, general price increases were not uniformly matched by 



Table B-1 

COMPARISON OF CHANGES IN CONSUMER PRICE INDEX 



AND IN 


THE COST OF 


ECONOMY FOOD 


PLAN: 1959-1968 




Consumer 






Price 


Index 




Year 


All Items 


Food 


Economy Food Plan 


1968 


113.6 


113. 5 


108.7 


1967 


109.0 


109.6 


106.5 


1966 


106.0 


108.7 


106.5 


1365 


103.0 


103.5 


102.2 


1964 


101.3 


101.2 


100.0 


1963 


100.0 


100.0 


100.0 


1962 


98.8 


98.6 


97.8 


1961 


97.7 


97.6 


97.8 


1960 


96.6 


96.5 


100.0 


1959 


95.1 


95.4 


97.8 



Source: U.S. Bureau of the Census, Current 

Population Repor ts, "Special Studies — 
Revision in Poverty Statistics, 1959 to 
1968," Series P-23, No. 28 (August 12, 
1969). 



U.S. Bureau of the Census, Current Population Reports , P-60, No. 86, 
"Characteristics of the Low-Income Population, 1971," p. 17 (1972). 
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food increases. Between the years 1959 and 1966 the CPI increased by 
13t7 percent while the poverty threshold incre^tsed by only 7,9 percent. 
The Review Committee also weighed the available evidence relating to farm 
families and concluded that 85 percent more clearly represented the dif- 
ferential between farm and non-farm families. (Table B-2 gives a sample 
breakdown for different family types following the Federal Interagency 
Committee's changes,) 



Table B-2 



WEIGHTED AVERAGE THRESHOLDS AT THE LOW- INCOME LEVEL IN 1971 



BY SIZE OF 


FAMILY 


AND SEX 


OF HEAD, 


BY FARM 


-NONF ARM 


RESIDENCE 








Nonf arm 






farm 










Male 


Female 




Male 


Female 


Size of Family 


Total 


Total 


Head* 


Head* 


Total 


Head* 


Head* 


All unrelated 
















individuals 


$2,033 


$2,040 


$2,136 


$1,978 


$1,727 


$1,783 


$1,669 


Under 65 yr 


2,093 


2,098 


2,181 


2,017 


1,805 


1,853 


1,715 


65 and over 


1 , 931 


1,940 


1,959 


1,934 


1,652 


1,666 


1,643 


All families 


3,700 


3,724 


3,764 


3,428 


3,235 


3,242 


3,079 


2 persons 


2,612 


2,633 


2,641 


2,581 


2,219 


2,224 


2,130 


Head under 
















65 


2,699 


2,716 


2,731 


2,635 


2,317 


2,322 


2,195 


Head 65 
















and over 


2,424 


2,448 


2,450 


2,437 


2,082 


2,081 


2,089 


3 persons 


3,207 


3,229 


3,246 


3,127 


2,745 


2,749 


2,627 


4 persons 


4,113 


4,137 


4,139 


4,116 


3,527 


3,528 


3,513 


5 persons 


4,845 


4,880 


4,884 


4,837 


4,159 


4,159 


4,148 


6 persons 


5,441 


5,489 


5,492 


5,460 


4,688 


4,689 


4,656 


7 or moi e 
















persons 


6,678 


6,751 


6,771 


6,583 


5,736 


5,749 


5,510 



For unrelated individuals, sex of the individual. 

Source: U.S, Bureau of the Census, Current Population Reports , P-60, 

No. 86, ^^Characteristics of the Low-Income Population, 1971,** 
p. 18 (1972). 
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Subsequent studies have charged that several flaws remain in the 
Orshansky index. The President's Commission on Income Maintenance Pro- 
grams criticized the index as inadequate on the following grounds: (1) 
the Orshansky poverty index does not provide a nutritionally adequate 
diet because the Economy Food Plan was developed for ''temporary or emer- 
gency use when funds are low/' and (2) "the meal-planning, cooking, and 
shopping qualifications needed to reproduce the diet in an ordinary house- 
hold are not characteristic of most low-income housewives."* The Commis- 
sion also asserted that the one-third factor, whereby a family is not 
defined poor unless its income is less than three times its necessary 
food expenditures under the EFP, is in error. The one-third figure was 
based on the expenditure patterns of low income households of all sizes 
and therefore the level of expenditures resulting from this "multiplier 
method" may be too small to cover all the necessities. 

As a replacement model, the President's Commission suggests use of 
a form of "The City Worker's Family Budget" developed by the Bureau of 
Labor Statistics for a number of areas in the United States. The Commis- 
sion notes that although the City Worker's Budget is at a level too high 
for defining poverty ($8,000 to $9,000 for a family of four persons), it 
is an example of a refined use of the budget technique developed by the 
Social Security Administration because it costs out all the components 
necessary for an adequate living standard. 

Another issue singled out by the President's Commission is what it 
calls "anomalous features" of the index. For example, the Orshansky index 
is lower for females than for males, presumably because single Temales eat 
less than single males, "if single women spend more for clothes than 
single males, the differences in poverty lines make no sense. Furthermore, 



President ' s Commission, op. cit. , p. 10. 
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the progression of weighted indices as lamily size increases bears little 
relation to anything known about family economics of scale/'* 

By early 1973, a new government task force had been established to 
redefine poverty in order ''to come up with a new, less politically loaded 
term to describe the nation's poor/'t According to the Washington Rosi , 
the task force began with the assumption that the current count of 25 mil- 
lion poor under the Orshansky index exaggerated the seriousness of the 
problem because the current definition included only cash payments for 
determining income — and omitted noncash benefits such as lood stamps or 
Medicare. A GAO report released by the Joint Economic Committee studied 
the impact of noncash benefits and concluded that their impact is "far 
greater and more complicated than most analysts had supposed." Included 
in these payments were "$2.1 billion in food stamp bonuses; $270 million 
in free food commodities; $65 million in child nutrition programs, and 
$1.9 billion in subsidized housing."''' 

In addition to such "under-measurcments, " the GAO ^tudy noted that 
while the current standard of $4,137 for a nonfarm family of four might 
be appropriate for a family in a small town, it would be insufficient lor 
a family in a big city such as New York or Chicago. Furthermore, the GAO 
emphasized that it would stress neutral terms such as "low income" rather 
than "poverty" in defining minimum income levels;. 



Ibid., p. 29. 

William Chapman, "Definition of Poverty Studied," Washington Post, 
p. A-3, (March 31, 1973). "gAO Study of Poverty Discussed — Ed.", 
Washington Post, p. A-18, (April 12, 1973). 
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Appendix C 
SIMULATION METHODOLOGY 



Intirastate Analyses 

Present Title I Distribution Criteria 

/ 

' To project the consequences of Title I dollar distribution changes 
one must first understand the complicated structure under which funds 
were allocated until 1974. The first step in this process was, within 
each state, to determine the number of children in each county who qual 
ified as "Title I Formula Children," Such children were "counted" in 
the following ways: 

(1) They come from families whose annual incomes are under 
$2,000 as determined by the 1970 Census (for FY 1974 
allocations) or (for allocations through FY 1973) by 
the I960 Census. 

(2) These "Census Children" are members of families who have 
annual incomes in excess of $2^000 but who are neverthe- 
less sufficiently poor to qualify for state aid under 
Aid t(^ Families with Dependent Children.* 

(3) These "AFDC Children" are under state authority because 
they are neglected or delinquent, 

(4) Children are residing in state supervised foster homes. 

There are also categories for Title I funding for migrant children 
and numerous other refinements further complicated che financial 



if 

In some states where AFCD allocations are extraordinarily liberal, ex- 
tending upward through substantially higher income classifications, a 
segment of AFDC children are included who come from families whose 
income is above the $2,000 level but who are among the "poorest" of 
the AFDC classifications. 
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distribution of Title I funds.* In the main, however, the four categories 
above sufficed for an understanding of the changes in distributional cri- 
teria that would occur under the proposed switch to achievement test 
scores as a measure of disadvantage. 

The number of children in each category designated by the county 
as Title I Formula Children is then transmitted to the State Education 
Department, which sums these data and transmits the results to the U.S. 
Office of Education, Dollar amounts are then computed by USOE for each 
state and for each county within each state. 

If a state's mean per pupil current operating expenditure is less 
than the national average, the federal government uses the national average 
pcr-pupil expenditure as a multiplier to determine the state's Title I 
grant. If a state is spending more, USOE uses the state's own average 
per-pupil expenditure when. computing the Title I grant. Each state re- 
ceives a dollar grant equal to the product of its total number of Title I 
Formula Children multiplied by the appropriate average por-pupil expendi- 
ture. County entitlements are determined in the same way: the appropri- 
ate (state or national) average per-pupll expenditure is multiplied by 
the total number of Title I Formula Children in the county. The resulting 
dollar amounts are termed the "Maximum Authorized Grant^* for each political 
jurisdiction. However, since almost all schools ar(^ operated by individual 
districts, district Title I allocations must be calculated. 

It is difficult to determine the number of neglected, delinquent, 
and foster children residing in a school district, because these children 
aru supervised by county agencies. Therefore, in most states, district 
allocations exclude children in these categories. Numbers of Census and 
AFDC children are determined for each district in a county. Then the sum 

For example, special incentive grants, which are related to the ' national 
effort index," and the "state effort index.'* 



11-124 



is found of all such children living in the county. After deducting ex- 
penses for Title I Formula Children attending county-administered schools, 
remaining dollars in the county Title I Maximum Authorized Grant are di- 
vided by the total number of Census and AFDC children residing in all 
districts of the county. This results in a figure which represents the 
county's dollar amount for each Title I Formula Child attending school in 
a district. This per-pupil dollar figure is subsequently multiplied by 
the number of AFDC and Census children residing in each district to deter- 
mine the number of dollars provided to each district for its Title I pro- 
grams. This is the district's Maximum Authorized Grant, 

In order to fund the Title I program fully, Congress would have to 
authorize the number of actual dollars determined by those calculations. 
To date, however, the legislative branch has declined to provide funding 
at this level. Therefore, after Maximum Authorized Grants are determined, 
each of them is multiplied by the fractional part of the total funding 
provided by the Congressional appropriation in order to identify the 
actual number of dollars a disti'ict will receive. If, as in FY 1973, the 
Maximum Authorized Grant totalled $4,038 billion, and only $1,316 billion 
was appropriated by Congress, then each district, county, and state ;n 
the United States actually received only 1316/4038 (or approximats. V 
32,5 percent) of its Maximum Authorized Grant, This final dollar figure 
is termed the "Ratably Reduced Amount," 

This study attempts to circumvent *he less than full funding problem 
determining changes in the number of eligible children, rather than 



The total number of Title I Formula Children is larger than the number 
of AFDC and Census children, as has been described. However, the dif- 
ference is small: in California, the total number of Title I Formula 
Children in FY 1973 was 796,690 and of AFDC and Census Children, 767,565. 
The AFDC and Census figure is 96.34 percent of the total number of Title I 
Formula Children. 
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precise dollar amounts, under the various allocation criteria. Where we 
do project changes in dollar amounts under the present and the proposed 
schemes, we use Ratably Reduced Amount figures as the base. 

What Test ? 

For the within-state allocation simulations, the criteria used were 
reading achievement tests administered to all children of one grade at 
one testing period. Each of these tests or modifications of them had been 
included in the Anchor test standardization effort (discussed later in 
Interstate Analyses). 

Beyond selection of the test instrument, a determination was made 
as to which children in each state our study would use. Normally, state 
testing programs include children in several grades. Since information 
for the Anchor Test Study (ATS) used children tested in the 4th, 5th, and 
6th grades, our simulation also chose reading tests given to children in 
those grades. In order to qualify for inclusion in the within-*state an- 
alysis, a state must have tested all children in its 4th, 5th, or 6th 
grade, (The term "all children" means that all children in attendance 
at the school the day the test was given were tested, except for special 
categories of children such as educationally handicapped or bilingual.) 

Defining Disadvantaged 

The last major decision to be made for our intrastate analyses was 
the definition of the term "educationally disadvantaged." As described 
above, educationally disadvantaged under the present Title I formulation 
means a child whose parents are poor. Many educators disagree with that 
definition, however, pointing to the imperfect correlation between poverty 
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and achievement. However, there is little consensus on what proportion 
of the children should be termed "educationally disadvantaged'* in the 
event that test scores are used as the criterion. An arbitrary cutoff 
point must be selected, and all children scoring below must be classified 
as ''educationally disadvantaged." The selection of this cutoff point is 
likely to be the focus of sizable and recurring argument. Therefore, •>ur 
analyses attempt to provide answers for persons holding different points 
of view. 

A nalysis #1 

We assumed initially that the number of educationally disadvr/a- 
taged under the original poverty criterion is the number of children we 
wish to label educationally disadvantaged under a newly inaugurated test 
score criterion. The percentage of children in each state who qualify 
today as Title I Formula Children is found by dividing the total number 
of Title I Formula Children by the total state enrollment. This percent- 
age is the one we desire to include under our new formulation. Therefore, 
the percentile cutoff point on the criterion test is selected to coincide 
with the percentage of Title I Formula Children now living in the state. 

After converting the percentile cutoff point to a raw score, 
the number of children in each district who score below the cutoff point 
is calculated and the results summed to give the total number of students 
who scored below the percentile cutoff point in the entire state. Divid- 
ing the number of children scoring below the cutoff point in each district 
by the total number in that classification in the Ktate yields the propor- 
tion of children in this classification for the entire state who fall 
into each district. 



For added discussion of this point, see John A. Emrick, "An Analysis of 
Proposals to Target ESEA Title I Support Based on Test Score Criteria," 
rtanford Research Institute, Menlo Park, California, (October 1973). 
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The same procedure i': used to identify the proportion of Title 
allocations now granted to each district: the number of AFDC and Census 
children living in each district is divided by the toti.1 number of AFDC 
and Census children for all districts. This number represents the dis- 
trict's share of Title I Formula Children. 

Now that the district's share of the current number AFDC and 
Census children and its share of the number of low-scoring children have 
been determined, the larger of the two numbers is subtracted from the 
smaller number. The difference between them, when divided by the dis- 
trict's share of AFDC and Census children, represents the change result- 
ing from a switch to test score criteria, if the number of educationally 
disadvantaged children is the same as the proportion of Title I Formula 
Children currently in the state. This number, when multiplied by 100, 
reveals the percentage change in the district's share of children (and 
funds). Districts are then grouped by the magnitude of the percent chang 
they would experience under the proposed new criterion. 

Analysis ^2 

The foregoing analyses reveal shifts that would occur within a 
state if it were desired to set the number of educationally disadvantaged 
children as closely as possible to the current number of children classic- 
fied as poor. However, those wishing to use test score criteria as the 
definition of educationally disadvantaged differ about the proportion of 



It must be remembered at this point that this traction does not repre- 
sent precisely each district's share of Title I Funds, because county 
allocations are based upon the number of neglected, delinquent, and 
foster children as well. As discussed above, the number of dollars 
allocated to each district for each of its AFDC and Census children 
differs by county. Therefore, our fractional share for each district 
is only an approximation, albeit an approximation remarkably close to 
the actual fractional share of Title I dollars. 
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children who are currently in need of compensatory education. For this 
reason, an analysis is also performed for test score percentile cutoff 
points of 10, 15, 20, 25, 30, and 35. This displays the manner in which 
the distribution changes when ever- increasing levels of test performance 
are chosen as criteria for defining the educationally disadvantaged. 

Providing a district-by-district analysis of these possibilities 
results in a cumbersome array of numbers (for California, over 8,000); 
therefore we present comparison data in the following fashion. 

Districts have been sorted into categories according to the 
proportion of AFDC or Cens'ic children in their enrollment (0-5, 5+ - 10, 
10+ - 15, 15+ - 20, 20+ - 25 percent, and so on). The total number of 
children scoring in each of the test score percentile cutoff distributions 
was found for each of the percentile groupings. Summing the numbers for 
each of the groupings reveals the number of students in the entire state 
scoring at this level. Determination of the proportion of the total 
state AFDC and Census Childr:^n falling into each grouping was computed 
in the manner used in the earlier analysis. 

Finally, subtracting the new portion of students in each test 
score percentile cutoff point from the proportion of AFDC and Census 
Children falling into each of the groupings, dividing by the proportion 
of AFDC and Census Children falling into each grouping, and then multi- 
plying the resulting fraction by 100 yields the percent change in the 
proportion of the state's children (and of state funds) flowing to each 
district under the particular criterion cutoff point. 

Note 

A special technique was introduced to resolve the problem of 
reallocation of children in secondary (high school) districts. Since the 
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test score criteria is based on tests given to 4th, 5th, or 6th graders, 
no district scores a; n available in these secondary districts. For states 
containing some secondary, some elementary, and some unified (K-12) dis- 
tricts, a special technique is needed to resolve the problem of realloca- 
tion of children via test score criteria in secondary districts. For 
states with all three types of districts, it is necessary to "nest" the 
elementary districts into the secondary districts. In other words, each 
secondary district included its own enrollment, its own Title I Formula 
Children figure, and the sum of the test scores in each percentile cutoff 
point for all of its "feeder" elementary districts. 

There were other problems. Despite statutes requiring each 
district test the required grade, some school districts report no test 
results. In California, 36 district:^ reported no 1973 test score figure:>. 
These districts — whose enrollment totalled 33,883 and AFDC and Census 
counts totalled 5,138 — were not included in the analysis* Similarly 
several large districts in Iowa could not be included because test re- 
sults were unavailable. There were a few other school districts in 
several states that could not be included for other statistical reasons. 

Interstate Analyses 
The Anchor Test 

If test scores were used as a distribution criterion instead of 
poverty levels, would some states receive a greater proportion of the 
total national Title I appropriation and some a smaller proportion than 
is now the case? The answer to this question requires a suitable instru- 
ment to compare the results of statewide testing programs. Publishers' 
norms cannot themselves be used for this purpose owing to variations in 
the sample upon which such norms are based. The problem is complicated, 
particularly in the instance of reading ability tests, by the large 
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variety and number of standardized tests in use by sstates. Fortunately, 
however^ the problem of noncomparability has been resolvedi at least par<- 
tially, through the Anchor Test Study (ATS). In April 1972, the Educa- 
tional Testing Service collected data on seven reading tests over a 
meticulously selected national sample of 4th| Sth, and 6th graders. The 
individual tests, together with their specific forms and levels, are pre- 
sented in Table C-1.* 

For all its usefulness, the Anchor Test Study was not able to solve 
the noncomparability problem completely. For example, the Anchor Study 
was conducted in the spring, and corresponding results for fall and win- 
ter (in the case of fall, winter, spring tables) or for fall (in the case 
of fall, spring tables) are not available. 

Furthermore, states frequently did not use the same test form as 
employed by the Anchor standardization effort. Not all states that have 
statewide testing conduct such tests in the 4th, 5th, or 6th grade. The 
problem is further compounded by the fact that there are several states 
(for example, New York and Delaware) which have highly developed testing 
programs, but employ their own tests. Such tests, of course, are normed 
on standards for the particular state in question and cannot be used in 
an interstate comparison. For these and other more technical reasons, 
our interstate simulations are not flawless and must be interpreted in 
light of such limitations. 



To include as many states in the interstate analysis as possible, two 
diviations from our practice of using 4th, 5th, and 6th grade test 
results were permitted. First, the state of Arizona, which tests its 
3rd grade children using the Metropolitan Achievement Test, and the 
State of Arkansas, which administers the SKA Achievement to its 8th 
graders, were included. Second, several states (New Hampshire, Arkan 
sas, and Mississippi) which tested only a portion of the children in 
the criterion grade were included in the interstate analysis. These 
last-mentioned states do not select test districts in a systematic 
manner. 
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A nalytic Assumptions 

Two major assumptions undergird our interstate analyses. The first 
is that, in those instances in which a state's testing programs utilized 
a sample of students, the scores from such an endeavor can be generalized 
to all students in that grade level throughout the state. The second 
assumption is that the proportions or distribution of student test scores 
in the grade or grades tested held for all grades (1st through 12th) in 
the state. 

Our use of the Anchor Test Study necessitated a further set of as- 
sumptions. There are differences between the ATS percentiles of 10, 15, 
20, 25, 30 and 35 and the publisher's equivalent percentiles. For example 
it might be that for spring testing of 6th graders, the ATS 10th percentil 
corresponds to the publishers' 9th. It is necessary to assume that the 
difference between the ATS and the publishers' norms for the level, grade, 
form, and season tested in each state is the same as the difference would 
be if the norming had been done by the ATS using the same level, form, 
grade, and season the state used. (This assumption does not, of course, 
need to be made where the grade, level, form and calendar season are the 
same for the ATS and the state; but this was true only in the cases of 
Mississippi and New Mexico.) In cases where the norming was imperfectly 
done, such an assumption is of heroic proportions. It was hoped that it 
would be possible to exclude states that did not fit the precise criteria 
of the Anchor Test Study, However, it eventually became obvious that such 
a rigorous stance would eliminate the possibility of conducting simulation 

Comparative Procedures 

Title I disbursements are based on **old** data. Normally, during the 
winter of the year preceding allocation, a census is tak^n in each state, 
county, and school district, of the number of children who are Title I 
eligible. Our simulations make comparisons between test scores derived 
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during the year students are counted for Title I (i.e«, the year preceding 
actual dollar allocations). In FY 1974^ 6^247^105 children nationwide 
qualified as Title I Formula Children^ according to data supplied by the 
USOE Title I Washington office. In the school year 1972-73, there were 
42,277,382 children enrolled in U.S. school districts.* Thus, for 1972- 
73, the number of children qualifying as Title I eligible is almost 15 
percent of the total U.S. enrollment.^ 

If, under any statutory change to test score allocations, the same 
number of children were to be included in the Title I program as is now 
the case, the proper cutoff point is the Anchor Test 15th percentile. 
However, it is also desirable to know whether or not states would pain 
or lose Title I funds if the cutoff point were set at other test score 
levels throughout the low-achieving range. Thus, the ATS 10th, 20th, 25th, 
30th, and 35th percentile cutoff points were also used for simulation pur- 
poses. 

It was necessary to employ several methods to determine the number 
of children in each of the target states scoring below each percentile 
cutoff point. Detailed descriptions of these procedures are included in 
Appendix D. Basicallyi however, the ra.; score for each state test percen- 
tile cutoff point (corresponding to the ATS percentiles of 10, 15, 20, 25. 
30, and 35) was determined. Then, from a frequency distribution of state 
test scores, we computed the proportion of children scoring below the ATS 



NEA Estimates of School Statistics, p. 27 (1973). 

It is interesting to note that the precise percentage, 14.528, consti- 
tutes a decline from FY 1973, in which the percentage of Title I eligible 
students was 15.669. This decline represents (1) a shift caused by 
changing to the 1970 Census data for allocation purposes, instead of the 
antiquated 1960 data; and (2) the obsolescence of the 1965-enacted 
$2,000 annual income criterion as an index of today's poverty. 
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percentile cutuff. This proportion was multiplied by the 1972-73 ADA for 
the state as given by the NEA Estimates of School Statistics, 

Finally^ we assumed in each case that the national total number of 
Title I dollars remained constant, and thus the funds would shift among 
states. 

Analyses of Alternative Poverty Criteria 

The methodology used in these analyses was identical to that in the 
Intrastate Analyses. Figures were obtained for the number of children 
from families classified as poor by the Orshansky definition, and for the 
number of children from families with 1970 Census income levels below 
$3,000, $4,000, $5,000 and $6,000. These figures were substituted for 
the Title I figures in the previous analyses in order to project funding 
shifts under the chosen indices of poverty. 

There were two weaknesses with the data file utilized. Only dis- 
tricts with enrollments of 300 or more were included. Furthermore, the 
300 figure was not firm, and in some instances the erTollments were larger 
or smaller. To circumvent this difficulty, it was decided in districts 
where no alternative poverty numbers were available, to multiply the 
Title I eligibility figure by the national proportion of alternative 
poverty children. For example, since there were 6,247,105 FY '74 Title I 
children in the United States and 7,700,368 children defined as "poor'* 
under the Orshansky formula, the final Orshanksy figure was obtained by 
multiplying the Title I figure by 1.2326 to take into account districts 
where no Orshanksy figure was available. 

In addition, the number of Orshansky children listed is the number 
between ages 5 and 17 who live within the boundaries of the district. 
Tl/as, where there were both elementary and secondary districts, as in 
California, students were counted twice. This problem was resolved by 
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multiplying the figure for all elementary districts by 0.6017, the pro- 
portion of elementary children enrolled in California school districts, 
and by 0.3983, the proportion of secondary child7;en« 

Six new formulas (see Table C-2) were developed and simulations run 
to determine the effects on six states. These formula simulations differ 
in one important respect from our earlier analyses: they project funding 
on the assumption that more dollars will be forthcoming and that the in- 
crease in funding will be proportional to the number of children added 
to the present Title I Formula Children, In other words, our simulations 
project the percent increase in eligible children. It is assumed that 
this increase will correspond with the percentage increase in funding. 

Double-Count Simulations 

In Simulations V and VI (which are double-counts of Simulations I 
and II), the number of children is defined as the total of estimated 
academically deficient children plus the poverty children. In Double- 
Count Simulation V, I (Simulation V) poverty children reflect present 
Title I counting methods and in double -count Simulation VI, II (Simula- 
tion VI) they are defined according to the Orshansky index. 

Several technical decisions had to be made to deal with the data on 
individual states. The most burdensome are presented below. 

Iowa 

Figures for counts of AFDC, Census, and Neglected and Delinquent 
children given us from the state of Iowa differ from those released by the 
U.S. Office of Education. Iowa counts are based on Iowa state income tax 
returns rather than on the 1970 Census. The differences would not pose 
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problems except that the total number of children is startlingly different* 
Here arc the U.S. Office of Education state totals. 



1970 Census 



AFDC 



Other 



Total 



22,459 



27,315 2,937 52,711 



The Iowa state totals are: 



1971 Iowa 
Income Tax 



AFDC Plus 
Foster 



Other 



Total 



63,827 



32,818 



234 



96,879 



In our simulations, the totals for Iowa have less meaning since the AFDC 
counts are inflated to begin with. Percentage increases related to a 
population of 96,879 are much smaller than they would be if related to 
the population of 52,711 — the number of children upon which funding is 
based. The AFDC/Census count is utilized in simulations I, III, and V. 
Thus the shifts projected for these simulations are artificially small. 



equated to national norms. For this reason it should not, strictly speak- 
ing, be included in these simulations that estimate the number oi ':hildrcn 
residing in the state who score in the bottom 16 percent nationally. How- 
ever, for purposes of illustration it was decided to include Delaware, 
It was assumed that the low-scoring population within the state was 16 
percent of the state, i,e,, the same as the proportion of academically 
deficient children in the national population. 



Delaware 



As previously noted, Delaware utilizes a test that cannot be 
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Massachusetts 

Persistent problems were encountered in the Massachusetts data* 
Most particularly, it was not possible to obtain a frequency distribution 
for the state's 4th grade testing program. Thus the number of children 
scoring in the bottom 16 percent nationally had to be estimated using 
only a mean and standard deviation. The resulting figure, totaling only 
6 percent of the state's children, was so small that it was felt the dis- 
tribution of scores for the state testing program was probably not normal 
in form. Nevertheless, inasmuch as Massachusetts is an important state 
from the point of view of size, we decided to utilize the figures in this 
analysis. 

Simulations I and II 

The first simulation takes things as they are today, utilizing the 
present definitions of poverty stemming from Census and AFDC counts. The 
number of children scoring in the bottom 16 percent of the state reading 
test is estimated for each district. This figure represents the estimated 
number of educationally disadvantaged children in the schools of the state. 
The new ''Title I Formula Children" figure for Simulation I is the larger 
of these two figures. The percent change for the district is figured by 
the formula 

N. - 0. 
C = — ^ X 100 

where 

C = percent change 

= number of new Title I Formula Children in District i 

O. = number of *'old" (i.e., present) Title I Formula Children 
in District i. 

Percent changes for the entire state are calculated the same way. 
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For Simulation II, the procedure is identical except that the **old** 
Title I Formula Children figure is the 1970 Census determination of the 
number of Orshansky index children residing in the district. In other 
words, the simulation measures the shifts from Orshansky index allocations 
to allocations based on the greater of the number of Orshansky and of ed- 
ucationally disadvantaged children residing in the district. These re- 
sults are presented in the same form as for Simulation I. 

Simulations III and IV 

These analyses are identical to Simulations I and II (III correspond- 
ing to I and IV corresponding to II) except that only if the number of 
academically deficient children exceeds the national average of 16 percent 
is the district allowed to choose the larger of the two measures. In 
other words, if a district has 13 percent academically deficient children 
and 10 percent poor children, it cannot use the academically deficient 
number in Simulations III and IV. The changes in the number and percentag 
of eligible children will be correspondingly smaller. 

Simulations II and IV vary slightly in numbers of districts (and 
children) in each category from the earlier Orshansky-test score simula- 
tions. These variations are due to different methods of estimating 
Orshansky populations when no figures were available. The earlier method 
multiplies the AFDC count by the national ratio of Orshansky/Title I 
children. This simulation substitutes state ratios, since it was decided 
they yield a more precise estimate of the number of Orshansky children 
residing in the district. 
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Appendix D 
SIMULATION RESULTS 



Th.; tables of results in this appendix are grouped as shown below. 
Intrastate Shifts in Funding Tables 



By Numbers and Enrollments of Districts 

Alabama 
Delaware 
Hawaii 
Iowa 

Massachusetts 
Michigan 
New Mexico 
New York 

By Type (Urban/Rural) and Ethnic 
Minorities of D istricts 

California 
Alabama 



Interstate Analyses of Achievement 
Test Scores* 

Alabama 

Arkansas 

Arizona 

California 

Delaware 

Hawaii 

Iowa 

Massachusetts 



Notes relating to each state follow each tabl9# 
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D-1 to D-3 

D-4 to D-6 (and note) 

D-7 to D-9 

D-10 to D-12 

D-13 to D-15 

D-16 

D-17 to D-19 (and note) 
(note only) 



D-20 to D-23 
D-24 



D-25 
D-26 
D-27 
D-28 

(note only) 

D-29 

D-30 

(note only) 



Interstate Analyses of Achievement 



Test Scores (Continued) 



Tables 



Mississippi D-31 

New Hampshire D-32 

New Mexico D-33 

North Carolina D-34 

North Dakota D-35 

Rhode Island D-36 

Tennessee D-37 



Interstate Comparisons Using Alternative 
(Orshansky Index or Family Annual Income) 
Poverty Criteria 



Percentage Changes In Number of Children 

Alabama D-38 to D-42 

California D-43 to D-47 

Delaware D<-48 to D-52 

Iowa D-53 to D-57 

New Mexico D-58 to D-62 



Absolute Number of Children 



Alabama 
California 
Delaware 
Iowa 

Massachusetts 
New Mexico 



D-63 to D-67 
D-68 to D-72 
D-73 to D-77 
D-78 to D-82 
D-83 to D-87 
D-88 to D-92 



Percentage of Children 

Alabama 
California 
Delaware 
Iowa 

Massachusetts 
New Mexico 



D-93 to D-97 
D<-98 to D-102 
D-103 to D-107 
D-108 to D-112 
D-113 to D-117 
D-118 to D-122 
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Interstate Comparisons Using Alternative 
(Orshansky Index or Family Annual Income) 
Poverty Criteria (Continued) 



Tables 



Interstate Shifts ^ by Region 



Rocky Mountains D-123 

Great Lakes D-124 

Mideast D-125 

New England D-126 

Pacific D-127 

Plains , D-128 

Southeast D-129 

Southwest D-130 



Double-Count, Simulation I D-131 to D-136 

Uouble-Count , Simulation II D137 to D-142 



Simulations I, II, III, and IV 



Alabama 
California 
Delaware 
Iowa 

Massachusetts 
New Mexico 



D-143 to D-146 
D-147 to D-150 
D-151 to D-154 
D-155 to D-158 
D-159 to D-162 
D-163 to D-166 
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Note on Delaware Simulation 



In a conversation with Dr. William !• Corkle, Supervisor, ESEA. 
Title I Management and Operations , Delaware Department of Public Instruc 
tion, it was revealed that the Delaware figures include only AFDC cjp.nts 
the low-income figures from Census data are not included. 

The AFDC count is based on figures supplied to his office by the 
State Welfare Department which were collected in the autumn of 1971 • 
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Note on New Mexico Simulation 



New Mexico presents some special problems, most of which arise from 
altering decision rules regardin^T allocation levels. New Mexico experi- 
enced (like other states) some sharp changes in the number of Title I 
Formula Children residing in each district when the figures from the 1970 
Census were used instead of the figures from the 1960 Census* Consequently, 
Dr. Gilbert Martinez of the Title I office was forced to make some adjust- 
ments in his funding projections. New Mexico now guarantees each local 
district 90 percent of its previous yearns funding. To finance this 
guarantee, it has stipulated that no district may receive more than 115 
percent of its previous yearns funding: this tends to skew the distribu- 
tion in such a way that the distribution of Title I Formula Children 
appears more ev^n than it actually is. 

The bulge in the New Mexico projections for the category of "30-35%^^ 
poverty children, where instead of the expected negative percent changes 
in funding there appear positive changes, is due to a single large district, 
Gallup. When Dr. Martinez was asked about Gallup — whether in fact there 
might be some circumstance which would produce this large and unaesthetic 
''bulge" — he answered that he could not think of anything, but did say that 
the Gallup funding for this year should have been $662,000, a great in- 
crease over the $542,000 it received last year. Cutting the projected 
figure to 115 percent of the previous yearns figure yields only $623,000, 
which is 6 percent less than it actually qualifies for. This differential 
is not large enough, however, to account for the appearance of the mysterious 
bulge. Thus its source, unfortunately, remains a mystery. 
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Note on New York 



We did not ourselves analyze New York state. However, from another 
study, we can see that the results in that state fit the pattern we de- 
scribe for our sample • 

Under the direction of Lorne H, Woolatt, Associate Commissioner for 
Research and Evaluation, a New York state study was conducted in the 
spring of 1973 comparing the Title I distribution by counties under the 
present system with that which would be the case if the criterion for 
participation was the bottom three stanines (22 percent) in the New York 
Pupil Evaluation Program, The test employed was developed in the state 
of New York itself. The 6th grade was the level from which the criterion 
was projected. Non-English speaking students were not tested and their 
scores were reported as zero, placing them in the bottom stanine. 

The New York study used counties as the analytic unit. It will be 
recalled that there is a disparity between district AFDC/Census counts 
and county enumerations (which include AFDC, Census, neglected, delinquent 
and foster children). In New York state, only 70 percent of the Title I 
Formula Children are AFDC and Census Children who are currently enrolled 
in the state's schools. Thus, only the 70 percent of the eligible county 
students were included in the projections, in order to maintain comparison. 
The New York Study findings are, as stated by Dr. Woollatt: 

....the number of reading disadvantaged is 33 percent 
more than the number of poverty eligibles: in New York 

City the numbers almost coincide; upstate the reading 

disadvantaged are double the number of the poverty 
criterion. 
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As a consequence, redistributing a fixed sum of ESEA 
Title I funds from poverty eligible to reading dis- 
advantaged would reduce New York City by $30,000,000 
and redistribute $30,000,000 in the rest of the State.* 



Letter from Lome H. Woollatt to Willis W. Harman, September 14, 1973. 
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Table D-25 



ALABAMA READING TEST RESULTS 
(All 4th Graders, Spring 1973) 



Probability 

Anchor Total that Student Estimated Number 

Percen- Reading Scored Below of Low Achievers 

tile Raw Score Raw Score (probability X enrollment) 

10 11 .2380 174,524 

15 13 .3342 245,067 

20 14 .3817 279,899 

25 15 .4275 313,484 

30 16 .4701 344,722 

35 17 .5119 375,374 



Alabama utilized the California Achievement Test, Level 3, Form A, 
Reading in its statewide testing program for 4th graders. The conversion 
to national percentiles was done by finding the raw score corresponding 
to the individual Anchor percentiles, and then finding the probability 
that Alabama students scored below this raw score by utilizing the Alabama 
Statewide frequency distribution. This probability was multiplied by the 
Alabama enrollment of 733,296 to obtain the total number of estimated 
academically deficient children in the state of Alabama. 
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Table D-26 

ARKANSAS READING TEST RESULTS 
(42 Percent of 8th Graders, Spring 1973) 



Students Scoring 

Anchor Below Anchor Estimated Number 

Percen- SRA 6th-Grade Percentile of Low Achievers 

tile Percentile Number Percent (percent x enrollment) 

10 9 1,932 11.49% 84,256 

15 14 2,779 16.53 121,214 

20 19 3,682 21.91 160,665 

25 24 4,777 28.42 208,403 

30 30 5,901 35.11 257,460 

35 35 6,708 39.91 292,658 



Arkansas was the only state In the survey that tested as high as the 
8th grade and tested a sample (42 percent of all the 8th grade children, 
from 36 percent of the districts were tested) . It used the SRA Achieve- 
ment Test, Form F, Red, Reading. A weakness exists In converting Anchor 
norms Into the SRA norms at the 8th grade, because children in 8th grade 
do not go to the same schools, and therefore the samples are probably not 
as closely alike as they would be if norms for both the tests were taken 
from the same sample. 

Arkansas* sampling was not random. Only districts that applied to 
participate in the testing program did so. However, funding was provided 
by ESEA Title IV-A and Title III/ so cost was not a factor in a district* 

Educational Testing Service, State Educational Assessment Programs , 
1973 Revision (1973). 
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decision to participate. The number of children tested was 16,805; the 
total enrollment was 733,296, 

Nevertheless, the comparison of Anchor percentiles 10, 15, 20, 25, 
30 and 35 with corresponding SRA Reading norms (Form "Green-E") shows 
that there is a close correspondence (9, 14, 15, 19, 24, 30 and 35). 
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Table D-27 

ARIZONA READING TEST RESULTS 
(All 3rd Graders, Fall 1972) 



Metropolitan 
Percentiles 



10 
15 
20 
25 
30 
35 



Metropolitan 

Standard 
Score, Fall 
3rd -Grade 

44.0 
46.5 
48.5 
50.5 
51.5 
52.5 



Grade 
Equiva- 
lent 

2.0 
2.2 
2.3 
2.4 
2.5 
2.6 



Percent of 

Children 
Taking Test 
Scoring Below 
Percentile 

5.50% 

9.93 
14.47 
16.83 
21.02 
23.56 



Estimated Number 
of Low Achievers 
(porcent X 
enrollment) 

25,708 
46,415 
67,636 
78,667 
98,252 
110,124 



Used instead of Anchor Percentiles (see explanation above). 

The Metropolitan Achievement Test (Primary II h, Reading), 3rd or 
4th graders, presents special problems. Metropolitan was normed using a 
test booklet, instead of a machine-scored answer sheet, for the 4th grade 
and below. The test booklet takes less of the student's time than the 
machine-scored answer sheet. Anchor norms were developed using a machine- 
scoring answer sheet. The difference between the Anchor norms and the 
Metropolitan norms is thus composed of (1) Metropolitan norming errors 
and (2) the differences caused by the two testing procedures. (Of course, 
there are undoubtedly errors in the Anchor norms too; but these are minute 
when compared with other norming procedures and for the sake of this study 
are taken to be the closest known estimator of the true parameter.) Fur- 
therir , the difference is greater in the middle of the distribution 
(around the 50th percentile) and smaller at the ends, probably because 
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the change made more difference to those In the middle and less to faster 
students (because the time factor was less of a constraint for them) and 
to slower students (because they already had all the time they needed to 
work the test.) 

It was decided that the Metropolitan norms represented a truer pic- 
ture of the actual distribution In this case than the Anchor norms, be- 
cause Arizona, In testing Its 3rd graders, used the test booklet rather 
than the computer-scored answer sheet. Therefore, it was a relatively 
simple matter to use the Metropolitan 10, 15, 20, 25, 30 and 35th percen- 
tiles, apply the fall 3rd-grade norms, and see how many of the Arizona 
3rd graders scored at these levels or below. The numbers were divided 
by the number of children taking the test (36, 211) to yield the percent 
of the children scoring below these levels. This percentage figure was 
then multiplied by the Arizona enrollment for 1972-73 of 467,421 to yield 
the projected numbers. 
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Table D-28 



CALIFORNIA READING TEST RESULTS 
(All 6th-Graders, Fall 1972) 



Percent of 

Anchor Publisher's Children Estimated Number 

Percen- Percentile, Scoring Below of Low Achievers 



tile Q-3 Raw Score Raw Score (percent X enrollment) 

10 10 31 ll7o 484,770 

15 15 38 18 793,260 

20 19 42 22 969,540 

25 23 45 26 1,145,820 

30 27 49 31 1,366,170 

35 31 52 36 1,586,520 



California ADA, fall 1972 = 4,407,000 

The California conversion from the Comprehensive Tests of Basic Skills 
Reading Level 2, Form Q was done in the following manner. The compari- 
son between Anchor percentiles 10, 15, 20, 25, 30 and 35 and CTBS percen- 
tiles for 6th grade. Level 3, Form Q, was noted. The corresponding 
percentiles for the CTBS were then written down in the column '^Publisher ' s 
Percentile, Q-3.'' 

Then, ia the volume titled, Percentile Rank Norms Tables and Summary 
of Test Scores for the California State Testing Program , Fall 1972, con- 
version was made from the raw scores to the state percentile ranks in 
Table 6 of that volume — '^Grade 6, Percentile Ranks, California State 
Testing Program, Fall 1972, Pupils Norms" — which yields the percent of 
California children in the 6th grade testing program who scored below 
these raw scores. 
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Delaware 



Delaware's statewide testing system utilizes a test compiled by its 
State Department of Education. It is a variation of the Sequential Tests 
of Educational Progress. However, there is no way of comparing results 
on Delaware's test with results from other states, since the test was not 
included on the Anchor survey. 

Therefore, Delaware is not included in this interstate analysis. 
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Table D-29 



HAWAII READING TEST RESULTS 
Sequential Tests of Educational Progress I, Form AA, 
(All Hawaii 4th Graders, October 1972) 



Percent of 





Equivalent 


Step II 




Students 


Estimated Number 


Anchor 


Step II 


4A 


Step I 


Scoring 


of Low Achievers 


Percen- 


4A 


Raw Score, 


Standard 


Below Stan- 


(percent x 


tile 


Percentile 


Fall 


Score 


dard Score 


enrollment) 


10 


8 


403.4 


226.4 


8.7% 


15,737 


15 


13 


404.4 


227.4 


13 


23,606 


20 


17 


405.2 


228.2 


16 


29,054 


25 


23 


406.4 


229.4 


20 


36,317 


30 


27 


407.25 


230.25 


25 


45,397 


35 


31 


408.25 


231.25 


28 


50,844 



Hawaii's estimated enrollment for 1972-73 is 181,587. 

Inclusion of Hawaii in the interstate analysis required that test 
scores from the 1957 edition of the Sequential Tests of Educational 
Progress I (Form AA) be converted to the new (1969) STEP. At first this 
seemed an impossible task, but accord to Pat Wheeler of the Educational 
Testing Service staff, who participated in developing the new edition, 
the standard scores on the new test were equal to those on the 1957 version 
plus 177. With this conversion in mind, the analysis became possible. 

Hawaii 4th graders took the STEP-4A in October. Therefore, after 
Anchor percentiles had been converted to STEP-II (4A) percentiles, it was 
necessary to determine the standard score corresponding to the STEP per* 
centiles in the fall of the year for 4th graders. Following this, 177 
points were subtracted from the standard scores, and the percentile point 
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in the Hawaii percentile distribution corresponding to this score was 
lound* For the Anchor 10th percentile, a special interpolation was 
required . 

Alter the percentage of Hawaii Ith-graders who had scored below the 
Anchor percentile cutoff points had been determined, this percentage was 
multiplied by the Hawaii enrollment of 181,587 to determine the estimated 
proportion of Hawaii school children scoring below the Anchor percentile 
cutoff points. 



An attempt to determine this point was made using Z-scores and the 
Hawaii standard distribution. However, it produced a percentile 
cutoll point of 14,83, higher than the score for the 15th percentile' 
cutotf point. This shows that the Hawaii distribution is not normal. 
No other way for filling in the missing percentile figure could be 
'!« U'rmin<.'U, except to take 2/3 of the 15th percentile ligurc, sinuv 
10^;; is 2/3 of I5'c* 
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Table D-SO 



IOWA READING TEST RESULTS 
(All 5th Graders, Winter 1972) 



Percent of 
Children 

Iowa Scoring Estimated Number 

Anchor ITBS I 1-5, Publisher's Grade Below of Low Achievers 
Percen- Spring Grade Score Score, Iowa Grade (percent X 
tiles Percentiles Winter Winter Scores enrollment) 

10 5 32 27 1.67% 10,307 

15 11 36 31 4 24,687 

20 15 38 32 5.33 32,896 

25 19 41 35 9.33 57,583 

30 25 44 37 12 74,062 

35 30 46 39 15.35 94,738 



Iowa has a very complete testing program. It tests all children in 
all grades every year. However, inasmuch as testing is done at all times 
during the year, a frequency distribution for each grade in Iowa is more 
difficult and time-consuming to produce. Consequently, the statewide fre- 
quency distribution for the 1972-73 testing was not available at the time 
this analysis was performed. We have used the frequency distribution from 
the 197l-*72 Iowa Tests of Basic Skills, level II, Form 5. Iowa's enroll- 
ment was 617,185. 

The analysis is identical to that performed on the North Dakota 
figures, described later. 
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Massachusetts 



Massachusetts tested its 4th grade children in January 1971, utiliz- 
ing the Comprehensive Tests of Basic Skills, Level 1, Form QW. The State 
Department of Education was unable to provide us with a frequency distri- 
bution for the state as a whole, and owing to the difference in the level 
utilized between the Anchor test and the state testing program (Level 1 
vs» Level 2), it was decided that a simulation was not possible in this 
case* 
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Table D-31 



MISSISSIPPI READING TEST RESULTS 
(85% of Districts' 5th Grade Children in the Spring of 1973) 



Anchor 
Percen- 
tile 



Reading 
Raw Score 
Corresponding 
to Anchor 
Percentiles 



Probability 
that Student 
Scored Below 
Raw Score 



Estimated Number 
of Low Achievers 
(probability x 
enrollment 



10 
15 
20 
25 

30 
35 



29 
33 
37 
40 
43 
15.5 



.3156 
.4090 
.5172 
.5975 
.6520 
.7071 



155,212 
201,146 
254,359 
293,851 
320,651 
3 J7,HP9 



N = 39,067 
X-E(x) 



Z = 



(x) 



Determination of Z-Score 



1. P(X ^ 29) 



(Z ^ 36.7 - 29 \ 
16.116 / 



P(Z ^ 0.4777) 



Interpolation 



.45 
.48 
.50 



.3264 
.3156 
.3085 



.3120 



2. P(X < 33) = P|Z ^ 33-36.7 I 

\ 16.116/ 

.20 = .4207 

= P(Z -- -.2295) or -.23 .23 = .4090 

.25 = .4013 

.4090 
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Determination of Z-Score 



3. P(X s 34) 



4. P(X < 37) 



5. P(X ^ 40) 



6. P(X 5 43) 



P/Z ^34-36 



i 



16.116 



P(Z ^ -.1675) 



.4207 



p/z S 37-36.7 \ 
\ 16.116/ 



P(Z .0434) 



.5172 



(Z < 40-36. 7 \ 
16.116/ 



P(Z < .2047) 



.5975 



p/z <: 43-36. 7 \ 
\ 16.116/ 



P(Z < .3909) 



Interpolation 



.15 
.1675 
.20 



.00 
.0434 
.05 



.20 
.2047 
.25 



.35 
.391 
.40 



.4404 
.4354 
.4207 



. 5000 
.5172 
.5199 



.5793 
.5975 
.5987 



.6368 
.6520 
.6554 



.6520 



7. P(X S 45.5) 



p/z s 45. 5-36. 7 \ 
\ 16.116 / 



P(Z S .5460) 



.50 
.546 
.55 



.6915 
.7074 
.7088 



.7074 
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The Mississippi analysis required a technique different from any 

other state. To begin with, the test, form, and season are all the same 

I 

as for the Anchor standardization (5th grade, California Achievement Test, 
Level 3, Form A, Spring testing), so that differences in publisher's norms 
are eliminated. But from there the similarity ends. Mississippi computes 
only the mean, standard deviation, and the lOth, 25th, 50th, 75th and 90th 
percentiles, and these for Mississippi. Therefore, if Mississippi was to 
be included in the interstate comparison, we had to assume that the dis- 
tribution of test scores was normal, and use Z-scores to determine the 
number of children who probably scored in the Anchor percentiles; of 10, 
15, 20, 25, 30 and 35. 

This assumption of normalcy is somewhat suspect because no distribu- 
tions are perfectly normal, and because the racial makeup of Mississippi 
Implies that the distribution may, in fact, be bimodal rather than normal 
in form. 

Mississippi also maintains the testing program as a voluntary one. 
Thus, the test included only 85,3 percent of the districts. However, 
since testing was mandatory for all students within participating districts, 
presumably a '^sampling'* error would be restricted to the participating 
districts and not extend to the student population within the district. 

The Mississippi test used was administered in the spring of 1972 
(and thus the information is one year older than for most of our states). 
The mean and standard deviation of the test results were used to project 
the probability, to four decimal places, that students scored below the 
desired percentile cutoffs. These probabilities were then multiplied by 
the ADA figure for Mississippi for 1972-73 (491,800) to give the number 
of students who would qualify statewide if the results were the same as 
in the preceding year. 
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New Hampshire tests only a portion of its children, owing to a short- 
age of funds. In 1970 the state tested 8,467 6th-grade students, which 
is a larger and presumably bettor sample of its 13,000 6th-graders than 
the 6,400 tested in the fall of 1971. Thus, figures from the 1970 testing 
were utilized. 

The Stanford Achievement Test provides no ''Total Reading*' score 
(Level II, Form X, Word Meaning and Paragraph Meaning, were used). There- 
fore, when making use of this test, the scores for both the Word Meaning 
Tost and the Paragraph Meaning Test must be averaged to provide the best 
estimate of the total reading ability of children taking the test. 

Initially, the Anchor percentiles of 10, 15, 20, 25, 30, and 35 were 
converted to the SAT percentiles on Form X (the test used in the Anchor 
norming) . These SAT percentiles were then utilized in determining the 
proper grade score for 6th graders taking the Word Meaning and the Para- 
graph Meaning tests in the fall of the year. 

The grade scores were then converted to raw scores. From frequency 
distributions provided by the New Hampshire State Department of Education, 
the number of children scoring below each of these raw score cutoff points 
was found. The number of children in each category was then divided by 
the total number of children taking the test to find the percentage of 
children scoring below each cutoff point. This percentage for the Word 
Meaning test and for the Paragraph Moaning test were averaged. This 
averaging represents an estimate of the total percentage of children 
whose 'Total Reading" score falls below the Anchor criterion cutoff 
points, and when multiplied by the total New Hampshire enrollment for 
1972-73 of 155,300, yields the best estimate of the total number of Now 
Hampshire students who score below these criterion cutoff points. 
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Table D-33 

NEW MEXICO READING TEST RESULTS 
(All 5th Graders, October 1972) 



Anp Hr>T* 

Percen- 


U UX 1 biK^I^ b 

Percentile, 


Kaw 
Score, 


Students Scoring 
Below Raw Score 


Estimated Number 
of Low Achievers 
(percent x 


tile 


Q-2 


Fall 


Number 


Percent 


enrollment) 


10 


11 


25 


3,105 


13.06% 


36,381 


15 


16 


30 


5,500 


23.14 


64,443 


20 


21 


33 


6,672 


28.07 


78,176 


25 


26 


37 


7,870 


33.11 


92,213 


30 


31 


41 


9,467 


39.83 


110,925 


35 


35 


44 


10,791 


45.40 


126,438 



N Taking Test = 23,771 
New Mexico Enr. = 278,525 

The test used by New Mexico was the Comprehensive Tests of Basic 
Skills, Level 2, Form Q, The total numbers of 5th graders scoring in 
the Anchor 10th, 15th, 20th, 25th, 30th and 35th percentiles were deter- 
mined by adding the sum of the numbers of children in each of the New 
Mexico ethnic groups who scored in each percentile range, since the fre- 
quency distribution for the state's 5th graders as a whole was not 
available. 

With this exception, the New Mexico projections were conducted in 
the same manner as for other states: Anchor percentiles were converted 
to publisher's percentiles, the raw score corresponding to each publisher' 
percentile interval was determined, and the number of 5th graders scoring 
in each interval was found. Dividing this number by the total number of 
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5th graders taking the test (23,771) yields the percentage scoring in 
the interval. This percentage was then multiplied by the New Mexico 
enrollment for 1972-73 of the 278,525 to yield the projected number of 
New Mexico children falling below the respective Anchor percentile cut- 
off points. 
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Table D-34 



NORTH CAROLINA READING TEST RESULTS 
(All 6th Graders, Spring 1972) 



Anchor 
Percen- 
tile 

10 
15 
20 
25 
30 
35 



Publisher ' s 
Percentiles 

8.3 
12 
16 

20.6 
26 

30.7 



Percent 
of Students 
Raw Scoring Below 
Score Raw Score 

44 13 % 

48 20 

50 25 

53 31 

56 39 

58 42.5 



Estimated Number 
of Low Achievers 
(percent x 
enrollment) 

165,177 
254,119 
317,649 
393,884 
495,532 
540,003 



North Carolina used the Iowa Test of Basic Skills, Level II, Reading. 
The state tests its 6th grade students in the spring. The conversions 
were done in the manner of the other ITBS conversions. The state's enroll- 
ment in 1972-73 was 1,270,595. 
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Table D-35 



NORTH DAKOTA READING TEST RESULTS 
(All 5th Graders, Fall 1972) 



Anchor Publisher ' s 



Grade 



Students Scoring 



Estimated Number 
of Low Achievers 



Percen- 
tile 


Perc entile, 
II-5 


Equiva- 
lent 


Below 
Number 


Raw Score 
Percent 


(percent x 
enrollment) 


10 


5.5 


2.9 


145 


4.38 


5,979 


15 


11 


3.3 


313 


9.47 


12,928 


20 


15.3 


3.5 


413 


12.50 


17,065 


25 


19.3 


3.7 


467 


14.13 


19,290 


30 


25.3 


4.05 


664 


20.10 


27,440 


35 


30 


4.3 


841 


25.46 


34,757 


North 


Dakota tested 


3,303 5th 


grade 


students in 


the fall of 1972 



using ihu Iowa Tests of Basic Skills, Form 5, Level II, Reading, This 
is the same for for which grade 5 was standardized in che Anchor Study, 
so the only shift necessary was to go from the Anchor spring norms to the 
ITBS fall norms. The same assumption was made here a:: in earlier analyses: 
that the differences between the Anchor percentiles and the publisher's 
percentiles would be the same had they been done in the fall as in the 
spring. The state's enrollment was 136,518. 
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Table D-36 



RHODE ISLAND READING TEST RESULTS 
(All 4th Graders, Fall 1972) 



Anchor 
Porcen- 

lilo 



Publisher ' s 
Percentiles 



Grade 
Score 



Students Scoring 
Below 
Grade Score 
Number Percent 



Estimated Number 
of Low Achievers 
(percent x 
enrollment) 



10 


7 


22 


1319 


17.92 


32,256 


15 


14 


26 


1885 


25.60 


46,080 


20 


19 


28 


1894 


25.72 


46,296 


25 


23 


30 


1916 


26.02 


46,836 


30 


27 


32 


1951 


26.50 


47,700 


35 


33 


34 


2595 


35.24 


63,432 



Rhode Island conducts its statewide testing program on 4th and 8th 
graders. Our analysis is concerned with the 4th-grade testing using the 
Iowa Tests of Basic Skills, Form S, Reading, administered in the fall of 
1972. At that time 7,362 students were tested out of the enrollment of 
180,000. The analysis is comparable to the one carried out for the North 
Dakota ITBS. 
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Table 0-37 



TENNESSEE READING TEST RESULTS 
(All 5th Graders, Fall 1970) 



Percent of 

Students Estimated Number 

Anchor Scoring Below of Low Achievers 

Percen- Metropolitan Standard Standard (percent X 

tile Percentile Score Score enrollments) 

10 10 56 10% 84,500 

15 14 59 17 143,650 

20 . 18 61 22 185,900 

25 24.5 64 31 261,950 

30 30 67 39 329,550 

35 34 69 45 380,250 



Tennessee tested its 5th graders using the Metropolitan Achievement 
Test — 1970 edition. Form F — and its statewide test distribution was ana- 
lyzed in a manner very similar to other states. The Anchor percentiles 
have been coverted to corresponding Metropolitan percentiles, which were 
then used to find the standard score cutoffs for 5th graders tested in 
the fall. From these standard scores, the percent of Tennessee 5th graders 
scoring below the criterion cutoff points was determined, and then multi- 
plied by the total Tennessee enrollment for 1972-73 of 845,000. 
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